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Abstract. We consider the perturbed mKdV equation <9 t u = —d x (d 2 u + u 3 — 
b(x, t)u) where the potential b(x, t) — bo(hx, ht), < h -C 1, is slowly varying with 
a double soliton initial data. On a dynamically interesting time scale the solution 
is 0(h 2 ) close in H 2 to a double soliton whose position and scale parameters follow 
an effective dynamics, a simple system of ordinary differential equations. These 
equations are formally obtained as Hamilton's equations for the restriction of the 
mKdV Hamiltonian to the submanifold of solitons. The interplay between algebraic 
aspects of complete integrability of the unperturbed equation and the analytic ideas 
related to soliton stability is central in the proof. 



1. Introduction 

We consider 2-soliton solutions of the modified KdV equation with a slowly varying 



external potential (1.1 ). The purpose of the paper is to find minimal exact effective dy- 
namics valid for a long time in the semiclassical sense and describing non-perturbative 
2-soliton interaction. In standard quantum mechanics the natural long time for which 
the semiclassical approximation is valid is the Ehrenfest time, log(l /h)/h - see for 
instance [7]. The semiclassical parameter, h, quantifies the slowly varying nature of 
the potential. 

Unlike in the case of single-particle semiclassical dynamics, that is, for the linear 
Schrodinger equation with a slowly varying potential, the exact effective dynamics 
valid for such a long time requires /z 2 -size corrections^} Those corrections appeared as 
unspecified 0{h 2 ) additions to Newton's equations (which give the usual semiclassi- 
cal approximation) in the work of Frohlich-Gustafson-Jonsson-Sigal [13j on 1-soliton 
propagation. That paper and its symplectic point of view were the starting point for 
[E1CL9]. 

Following the 1-soliton analysis of [HJ EE] the semiclassical dynamics for 2-solitons 
considered here is obtained by restricting the Hamiltonian to the symplectic manifold 
of 2-solitons and considering the finite dimensional dynamics there. The numerical 
experiments [TT] show a remarkable agreement with the theorem below. However, 
they also reveal an interesting scenario not covered by our theorem: the velocities of 



^A compensation for that comes however at having the semiclassical propagation accurate for 
larger values of h. 
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FIGURE 1 . A gallery of numerical experiments showing agreement with 
the results of the main theorem (clockwise from the left hand corner) 
for the external fields listed in (1.15) with the indicated initial data. 
The continuous lines are the numerically computed solutions and the 
dotted lines follow the evolution given by (1.4). The main theorem does 
not apply to the bottom two figures on the whole interval of time due 
to the crossing of c/s - see Fig|3j In the first figure in the second line, 
(1.4) still apply directy, but in the second one further modification is 
needed to account for the signs. 



the solitons can almost cross within exponentially small width in h and the effective 
dynamics remains valid. Any long time analysis involving multiple interactions of 
solitons has to explain this avoided crossing which perhaps could be replaced by a 
direct crossing in a different parametrization. This seems the most immediate open 
problem of phenomeno logical interest. 
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The effective dynamics follows a long tradition of the use of modulation parameters 
in soliton propagation - see for instance [E] , [2S] , [2E] , [30] , [32] and the numerous refer- 
ences given there. For non-linear dispersive equations with non-constant coefficients 
one can consult, in addition to [13], [3] , [E] , [IS] , [22] , and references given there. 

Here we avoid generality and, as described above, the aims are more modest: for the 
physically relevant cubic non-linearity we benefit from the completely integrable struc- 
ture and using classical methods we can give a remarkably accurate and phenomeno- 
logically relevant description of 2-soliton interaction. The equation (1.1 ) shares many 
features with the dynamical Gross-Pitaevskii equation, 

idfU = —d 2 u — |m| 2 u + V(x)u , 

but is easier to study, mathematically and numerically. In a recent numerical study 
Potter [31] showed that the same effective dynamics applies very well to iV-soliton 
trains in the case of perturbed mKdV and NLS. The soliton matter- wave trains cre- 
ated for Bose-Einstein condensates [33] were a good testing ground and our effective 
dynamics gives an alternative explanation of the observed phenomena. At the mo- 
ment it is not clear how to obtain exact effective dynamics for the perturbed NLS. 
To state the exact result we recall the perturbed mKdV equation [ID]. [11]: 

d t u = -d x {d 2 x u - b(x, t)u + 2u 3 ) , 

( Li> b(x,t) = bo(hx,ht) , 0</i<l, d a b G L°°(M 2 ) . 

For b = the equation is completely integrable and has a special class of iV-soliton 
solutions, g7v(x, a, c), a G M. N , c G M. N - see £ 1.1 and q3melow. For iV = 2 we obtain 



Theorem. Let 5o > and a, c G W 1 . Suppose that u(x,t) solves (1.1) with 

(1.2) u(x, 0) = q 2 (x, a, c) , \c% ± c 2 \ > 25 > , 25 < \c~j\ < (2<5 ) _1 • 
Then, for t < T(h)/h, 

(1.3) \\u(-,t) - q 2 (;a(t),c(t))\\ H2 < Ch 2 e Cht , C = C(5 , b ) > , 
where a(t) and c(t) evolve according to the effective equations of motion, 



,2 



a; = c„- 



sgn(cj)d Cj B(a, c, t) dj = sga(cj)d aj B(a, c, t) 



3 ~ 

^ B(a, c, t) = f - / b(x,t)q 2 (x, a, c) 2 dx . 



The upper bound T{h)/h for the validity of (1.3) is given in terms of 

(1.5) T(h) = min(<5 log(l//i), T (h)) , 5 = 5(5 , b ) > 

where for t < T (h)/h, \ci(t) ± c 2 (t)\ > 5o > and So < \cj(t)\ < 5 1 . Under the 
assumption (1.2) on c, T (h) > 5 2 , where 5 2 = 5 2 (So,bo) > is independent of h - see 
(p2l. 
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Remarks. 1. We expect the same result to be true for all N with H 2 replaced by 
H N . For N — 1 it follows directly from the arguments of [19J. That case is also 
implicit in this paper: single soliton dynamics describes the propagation away from 
the interaction region. 

2. The Ehrenfest time bound, T(h) < Slog(l/h), is probably optimal if we insist on 
the agreement with classical equations of motion (1.4). We expect that the solution is 
close to a soliton profile q%(x, a, c) for much longer times (h~°°7) but with a modified 
evolution for the parameters. One difficulty is the lack of a good description of the 
long time behaviour of time dependent linearized evolution with b present - see §[8] 
However, the modified equations would lack the transparency of (1.4) and would be 



harder to implement. The numerical study [31] suggests that for the minimal exact 
dynamics the error bound 0(h 2 ) in (1.3) is optimal. 

3. As shown by the top two plots in Fig jl] the agreement of the approximations given 
by (1.4) and numerical solutions of ( |1.1[ ) is remarkable. The codes are available at 
|17j . see also £1.4 Experiments support the preceding remark. 

4. The condition that \ci(t) ± c 2 (t)| > Si, that is, that the perturbed effective 
dynamics avoids the lines shown in Fig|2j could most likely be relaxed. Allowing that 
provides more interesting dynamics as then the solitons can interact multiple times. 



As discussed in §1.2 and Appendix [Bj we expect avoided crossing after ±Cj(£)'s get 
within exp(— c/h) of each other - see Figj3} Examples of such evolution, and the 
comparisons with effective dynamics, are shown in the lower two plots in Fig|TJ On 
closer inspection the agreement between the solutions and solitons moving according 
to effective dynamics is not as dramatic as in the case when ic/s stay away from 
each other but for smaller values of h the result should still hold. We concentrated 
on the simpler case at this early stage. 

5. The equation (1.1 ) is globally well-posed in H k , k > 1 under even milder regularity 
hypotheses on b. This can be shown by modifying the techniques of Kenig-Ponce- 
Vega [21j - see Appendix [Aj Although for k > 2 more classical methods are available, 
we opt for a self-contained treatment dealing with all if fc 's at once. 

6. Studies of single solitons for perturbed KdV, mKdV, and their generalizations 
were conducted by Dejak-Jonsson [TO] and Dejak-Sigal [II] . The perturbative terms, 
b(x,t), were assumed to be not only slow varying but also small in size. The mKdV 
results of [10] are improved by following [19] . For KdV one does not expect the same 



behaviour as for mKdV and the O (h 2 )- approximation similar to (1.3) is not valid - 
see [28| H6] for finer analysis of that case. 

7. The conditions that u(x,0) = q2(x,d,c) can be relaxed by allowing a small per- 
turbation in H 2 - see [9] for the adaptation of [19] to that case. Similar statements 
are possible here but we prefer the simpler formulation both in the statement of the 
theorem and in the proofs. 
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Figure 2. On the left we show IR 2 \ C and on the right examples 
of double solitons corresponding to (ci,c 2 ) indicated on the left (with 
a i = a 2 = in the first figure and a 2 = —a\ = 1, in the other two). At 
the coordinate axes the double soliton degenerates into a single soliton. 
As one approaches the lines c\ = ±c 2 the solitons escape to infinities 
in opposite direction. 



In the remainder of the introduction we will explain the origins of the effective 



dynamics (1.4), outline the proof, and comment on numerical experiments. 



1.1. Double solitons for mKdV. The single soliton solutions to mKdV, (1.1) with 
6 = 0, are described in terms of the profile r](x, a, c) as follows. Let r](x) = sechx so 
that — T) + rj" + 2rj 3 = 0, and let r](x, c, a) = cr](c(x — a)) for a G M, c G IR \ 0. Then a 
single soliton defined by 

u(x, t) = i](x, a + c 2 t, c) 

is easily verified to be an exact solution to mKdV. Such solitary wave solutions are 
available for many nonlinear evolution equations. However, mKdV has richer struc- 
ture - it is completely integrable and can be studied using the inverse scattering 
method (Miura [27] . Wadati [35]). One of the consequences is the availability of 
larger families of explicit solutions. In the case of mKdV, we have N -solitons and 
breathers. In this paper we confine our attention to the 2-soliton (or double soli- 



ton), which is described by the profile q2(x, a, c) defined in (3.2) below. The four real 
parameters, a G M 2 , and c G IR 2 \ C, 

C = {(ci, c 2 ) : ci = ±c 2 } U R x {0} U {0} x R , 



describe the position (a) and scale (c) of the double soliton. At the diagonal lines the 
parametrization degenerates: for c\ = ±c 2 , g 2 = 0. At the coordinate axes in the c 
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space, we recover single solitons: 

q 2 (x, a, (ci, 0)) = -ci77(x, a 1; c x ) , <? 2 (x, a, (0, c 2 )) = c 2 r?(x, a 2 , c 2 ) . 

Fig|2] shows a few examples. 

Solving mKdV with w(x, 0) = q 2 (x, a, c) gives the solution 

u(x, t) = q 2 (x, Oi + tei, a 2 + £c 2 , c) , 

that is, the double soliton solution. 

If, say, < ci < c 2 , then for |oi — a 2 | large, 

a, c) ss ?7(x, a x + «i, Ci) + 77(2;, a 2 + a 2 , c 2 ) 



where aj are shifts defined in terms of c, see Lemma 3.2 for the precise statement. 
This means that for large positive and negative times the evolving double soliton is 
effectively a sum of single solitons. The decomposition can be made exact preserving 
the particle-like nature of single solitons even during the interaction - see (3.11) and 
Fig§ 

We consider the set of 2-solitons as a submanifold of i/ 2 (IR;IR) with 8 open com- 
ponents corresponding to the components of 1R 2 \ C: 

(1.6) M = { q(-, a,c)\a = (a 1; a 2 ) e M 2 , c = (c u c 2 ) G M 2 \ C } . 

As in the case of single solitons this submanifold is symplectic with respect to the 
natural structure recalled in the next subsection. 

1.2. Dynamical structure and effective equations of motion. The equation 
( 1.1[ ) is a Hamiltonian equation of evolution for 

(1.7) H b (u) = l - J(u 2 x -u A + bu 2 )dx , 

on the Schwartz space, «S(R; M.) equipped with the symplectic form 



1 



1.8) oj(u, v) = - / / (u(x)v(y) — u(y)v(x))dydx . 



In other words, (1.1) is equivalent to 

(1.9) Ut = d x H' b (u), (H' b {u)^) = ^-H b {u + s V )\ s= ^ 

and d x H' b {u) is the Hamilton vector field of H b) H^ b , with respect to lo: 

u(<p,Z Hb (u)) = (H' b (u),(p) • 



For 6 = 0, 'Ehq is tangent to the manifold of solitons (1.6). Also, M is symplectic 
with respect to u>, that is, u is nondegenerate on T U M, u G M. Using the stability 
theory for 2-solitons based on the work of Maddocks- Sachs [21] , and energy methods 
(enhanced and simplified using algebraic identities coming from complete integrability 
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of mKdV) we will show that the solution to (1.1) with initial data on M stays close 
to M for t < \og(l/h)/h. 

A basic intuition coming from symplectic geometry then indicates that u(t) stays 

of 



close to an integral curve on M of the Hamilton vector field (defined using uj\ 
H b restricted to M: 

def 1 f 

H eB (a,c) = H b \ M (a,c) = H \ M (a,c) + - / b(x)q 2 (x, a, cfdx , 



(1.10) 



H \ M (a,c) 



:(\ci\ 



\ C 2\ 



u\m = dax A d\ci \ + da 2 A d\c 2 \ , 

2 

-h cS = ^2 sgn(c j )(<9 aj if cff d Cj - d c .H eS d aj 

3=1 



The effective equations of motion (1.4) follow. This simple but crucial observation 



was made in [18], [19] and it did not seem to be present in earlier mathematical work 
on solitons in external fields [13] . 

The condition made in the theorem, that \ci(t) ± c 2 (t)\ and \cj(t)\ are bounded 
away from zero for t < T (h)/h (where T (h) could be oo), follows from a condition 
involving a simpler system of decoupled /i-independent ODEs - see Appendix [B] Here 
we state a condition which gives an /^-independent T appearing in (1.5). 

Suppose we are given b(x,t) = bo(hx,ht) in (1.1) and the initial condition is given 
by q 2 (x,a,c), a = (di,a 2 ), c = (ci,c 2 ), \c\ ± c 2 | > 5 , \cj\ > S , We consider an 
^-independent system of two decoupled differential equations for 

A(T) = (Ai(T), A 2 (T)) , C(T) = (Ci(T), C 2 (T)) , 

given by 



d T A 3 =C]-b (A 3 ,T) 
d T Cj = Cjd x b (Aj,T) 



A(0) = ah, (7(0) 



J = 1,2. 



Then, for a given 5i < 5o, T (h) in (1.5) can be replaced by 

def 



(1.12) 



T = sup{T : \C X (T)±C 2 {T)\>8 U \C j (T)\>5 1 , j = 1,2} . 



1.3. Outline of the proof. To obtain the effective dynamics we follow a long tra- 
dition (see [13] and references given there) and define the modulation parameters 

a(t) = (a x (t), a 2 (t)) , c(t) = ( Cl (t), c 2 (t)) , 

be demanding that 



v(x, t) = u(x, t) - q(x, a(t), c(t)) , q = q 2 , 
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Figure 3. The plots of c and a for the external potential given by 
the last b(x,t) in (1.15), and c = (6,10), a = (—1,-2). We see the 
avoided crossings near times at which the decoupled dynamics (1.11) 



would give a crossing of c/s (see also Fig|6J). The crossings are avoided 
with exp(— 1/Ch) width and a\ = ci2 at the crossings. These cases are 
not yet covered by our theory. Of the five crossings of a/s in the bottom 
figure, three do not involve crossings of c/s are hence the description 
by effective dynamics there is covered by our theorem. However, in the 
absence of avoided crossing of c/s the solitons can interact only once. 



satisfies symplectic orthogonality conditions: 



lu(v, d ai q) = lu(v, d a2 q) = 

u(v, d Cl q) = u(v, d C2 q) = 



EFFECTIVE DYNAMICS OF DOUBLE SOLITONS FOR PERTURBED MKDV 



9 



These can be arranged by the implicit function theorem thanks to the nondegeneracy 
of u\ M - This makes q the symplectic orthogonal projection of u onto the manifold of 
solitons M . 

Since u = q + v and u solves mKdV, we have 

(1.13) d t v = d x (£ c , a v - 6qv 2 - 2v 3 + bv) - F , 
where 

£ c ,a = ~d 2 x - 6q(x, a, cfv , 
and Fq results from the perturbation and dt landing on the parameters: 

2 2 

= - c])d a] q + ^ <VM ~ d x(bq) . 

j=l 3=1 

We decompose F = F\\ + F±, where F\\ is symplectic projection of F onto T q M, and 
F± is the symplectic projection onto its symplectic orthogonal (TqM)^. As seen in 
(5.4), F» = is equivalent to the equations of motion (1.4) (we assume in the proof 
that c 2 > C\ > 0). 

Using the properties of q, we show that Fj_ is 0(h 2 ). In fact it is important to 
obtain a specific form for the 0(h 2 ) term so that it is amenable to finding a certain 
correction term later - see £j6] 

The estimates for F\\ are obtained using the symplectic orthogonality properties of 
v. For example, = (v,d^ 1 d aj q) implies 

= d t (v, d^da-q) = ( dfV , d~ x d aj q) + (v, d t d~ x d a .q) , 

t 

substitute equation | |1.13| 

which can be used to show that 

(1.14) < Ch 2 \\v\\ H 2 + \\vf H 2, 



sec 



m 



The next step is to estimate v satisfying (1.13) with v(0) = 0(h 2 ) (in the theorem 
v(0) = 0, but we need this relaxed assumption for the bootstrap argument). We want 
to show that on a time interval of length h' 1 , that v at most doubles. The Lyapunov 
functional E(t) that we use to achieve this comes from the variational characterization 
of the double soliton (see [231 §2] and Lemma 4.1 below): if 

H c (u) = h(u) + (c? + c 2 2 )h(u) + c\clh{u) , 

then 

H' e (q(;a,c)) = 0, V a G R 2 , 

and 

H"(q(-,a,c)) = )C c , a , 
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where K, ca is a fourth order operator given in (4.11) below. Hence 



S(t) = H c(t) (q(., a(t), c{t)) + v(t)) - H c{t) (q(; a{t), c(t))) , 

satisfies 

£{t) « (K, c>a v,v) , 

and, as in Maddocks-Sachs [24J for KdV, /C Cja has a two dimensional kernel and one 
negative eigenvalue. However, the symplectic orthogonality conditions on v imply that 
we project far enough away from these eigenspaces and hence we have the coercivity 

s\\v\\h < m ■ 

To get the upper bound on £(t), we compute 

j t £(t) = 0(h)\\v(t)\\%* + {K. c , a v, F\ { ) + {K, c , a v, F ± ) , 



see £[9j Using (1.14) we can estimate the second term on the right-hand side but 
= 0(h 2 ) only. We improve this to h 3 using a correction term to v - see §[8j and 
the comment at the end of this section. 
All of this combined gives, on [0,T], 



v 



2 H 2 < \\v(0)\\ 2 H 2+T(\Fn\\\v\\ H 2 + h 2 \\v\\ H 2 + |H|| 2 ) 



'I 

IFiiI < Ch 2 \\v\\ H 2 + 112 



H 2 J 

which implies 

\Hm<h\ \F\\\<h\ on [0,/T 1 ]. 

Iterating the argument 5 log(l/ h) times gives a slightly weaker bound for longer times. 
The 0(h 4 ) errors in the ODEs can be removed without affecting the bound on v, 
proving the theorem. 

In the proofs various facts due to complete integrability (such as the miraculous 



Lemma 2.1) simplify the arguments, in particular in the above energy estimate. 

We conclude with the remark about the correction term added to v in order to 
improve the bound on from h 2 to h 3 . A similar correction term was used in [19] 
for NLS 1-solitons. Together with the symplectic projection interpretation, it was the 
key to sharpening the results in earlier works. Implementing the same idea in the 
setting of 2-solitons is more subtle. The 2-soliton is treated as if it were the sum of 
two decoupled 1-solitons, the corrections are introduced for each piece, and the result 
is that F± is corrected so that 

\\F±\\h» < h 3 + /iV 7|ai - a21 

That is, when \ai — d2\ = 0(1), there is no improvement. However, this happens only 
on an 0(1) time scale and hence does not spoil the long time estimate. 
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1.4. Numerical experiments. Unlike NLS, KdV is a very friendly equation from 
the numerical point of view and MATLAB is sufficient for producing good results. 

We first describe the simple codes on which our experiments are based. Instead of 
considering (1.1) on the line, we consider it on the circle identified with [— 7T, it). To 
solve it numerically we adapt the code given in [MJ Chapter 10] which is based on 
the Fast Fourier Transform in x, the method of integrating factor for the —u xxx i— > 
—ik 3 u(k) term, and the fourth-order Runge-Kutta formula for the resulting ODE in 
time. Unless the amplitude of the solution gets large (which results in large terms in 
the equation due to the u 3 term) it suffices to take 2 , N = 8, discretization points 
in x. 

For X G [— 7r,7r) we consider B(X,T) periodic in X, and compute U(X,T) satis- 
fying 

d T U = -d x (d 2 x U + 2U 3 - B(X, T)U) , Ufa T) = U(-tt, T) . 
A simple rescaling, 

u(x, t) = aU (ax, a 3 t) , b(x, t) = a 2 B(ax, a 3 t) , 

gives a solution of (ITT) on [— Tc/a,n/a] with periodic boundary conditions. When a 
is small this is a good approximation of the equation on the line. If we use U(X,T) 
in our numerical calculations with the initial data q2{X, A, C), A G IR 2 , C G M 2 \ C, 
the initial condition on for u(x, t) is given by 

u(x, 0) = q%(x, A/ a, aC) . 



If we want c = aC to satisfy the assumptions (1.2), the effective small constant h 
becomes h = a and b in (1.1) becomes 

b (x,t) = h 2 B(x,h 2 t) . 

In principle we have three scales: size of B, size of d x B, and size of dtB, which 
should correspond to three small parameters h. For simplicity we just use one scale 
h in the Theorem. 

Figure [T] shows four examples of evolution and comparison with effective dynamics 
computed using the MATLAB codes available at [IT]. The external potentials used are 
given by 

B(x, t) = 100 cos 2 (x - 10 3 t) - 50 sin(2x + 10 3 t) , 
B(x, t) = 100 cos 2 (x - 10 3 t) + 50 sin(2x + 10 3 t) , 

(1.15) 

B(x, t) = 60 cos 2 (x + 1 - 10 2 t) + 40 sin(2x + 2 + 10 2 t) , 
B(x, t) = 40 cos(2x + 3 - 10 2 t) + 30 sin(x + 1 + 10 2 t) . 

The rescaling the fixed size potential used in the theorem, bo(x,t) = h 2 B(x,h 2 t), 
means that our h satisfies h ~ 1/5 in the last two examples. In the first two examples 
the scales in x are different than the ones in t: the potential is not slowly varying 
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in t if h 1/10. The agreement with the main theorem is very good in all cases. 
However, the theorem in the current version does not apply to the two bottom figures 



since the condition in (1.12) is not satisfied for the full time of the experiment. See 
also Fig. [3] and Appendix [Bj 

We have not exploited numerical experiments in a fully systematic way but the 
following conclusions can be deduced: 

• For the case covered by our theorem the agreement with the numerical solution 
is remarkably close; the same thing is true for times longer than To/h, with 



T defined by (1.12) despite the crossings of C/s (resulting in the avoided 
crossing of Cj's) The agreement is weaker but the experiments involve only 
relatively large value of h. 

The soliton profile persists for long times but we see a deviation from the 
effective dynamics. This suggest the optimality of the bound log(l//i)//t in 



(1.3). 



The slow variation in t required in the theorem can probably be relaxed. 
For instance, in the top plots in Figjl] max \dtbo\/ max\d x bo\ ~ 10, while the 
agreement with the effective dynamics is excellent. For longer times it does 
break down as can be seen using the Bmovie .m code presented in [T71 §3]. An 
indication that slow variation in time might be removable also comes from [2] . 



• When the decoupled equations (1.11) predict crossing of Cj's, we observe an 
avoided crossing of c/s - see Fig|3]and FigjH] with exponentially small width, 
exp(— 1/Ch). At such times we also see the crossing of a/s, though it really 
corresponds to solitons changing their scale constants - see FigjTj To have 
multiple interactions of a pair of solitons, this type of crossing has to occur, 
and it needs to be investigated further. 

1.5. Acknowledgments. The authors gratefully acknowledge the following sources 
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DMS-0901582, CP's visit to Berkeley in November of 2008 was supported in part 
by the France-Berkeley Fund, and M.Z. was supported in part by the NSF grant 
DMS-0654436. 

2. Hamiltonian structure and conserved quantities 
The symplectic form, at first defined on «S(R; M.) is given by 

(2-1) u(u,v) d = f (u,d- l v), (f,g) = J fg, 

where 

d^ttx^U - )f(y)dy 
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Then the mKdV (equation (1.1) with b = 0) is the Hamiltonian flow d t u = d x H' (u) 



and (1.1) is the Hamiltonian flow d t u = d x H' b (u), where 



Ho=\ Jiul-u') H b = y(ul-u* + bu 2 ) 

Solutions to mKdV have infinitely many conserved integrals and the first four are 
given by 



Io(u) = J udx , 
iAu) = u 2 dx . 



hiu) = I (u 2 x — -u 4 ) dx , 



I 5 ( u ) = J (u 2 xx - 10u 2 x u 2 + 2u b ) dx , 

which are the mass, momentum, energy, and second energy, respectively. In this 
paper we will only use these particular conserved quantities. 

We write Ij(u) = J Aj(u), which means that Aj(u) denotes the j-th Hamiltonian 
density. 

For future reference, we record the expressions appearing in the Taylor expansions 
of these densities, 

(2.2) A 3 (q + v) = AM + A' 3 {q){v) + \A\q)(v, v) + 0(v') , 

A[(q)(v) = 2qv , 
A! z (q)(v) = 2q x v x -4q 3 v, 

A ' 5 (q)( v ) = 2 qxxV xx - 20q x q 2 v x - 2Qq 2 x qv + 12g 5 w , 

and 



.2 



A'((q)(v,v) = 2v 2 
A'^q)(v,v) = 2v 2 x -I2q 2 v 2 , 

Al(q)(v, v) = 2v 2 xx - 20q 2 v 2 x - 20q 2 x v 2 - 80qq x vv x + 60q 4 v 2 
The differentials, I'Aq), are identified with functions by writing: 

(I'(q),v}= [ A'(q)(v). 
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It is useful to record a formal expression for /j(g)'s valid when A,(g)'s are polynomials 
in diq: 



(2.3) 



>dMq) 



^=d l q. 



The Hessians, Ij(q), are the (self-adjoint) operators given by 



(I?{q)v,v) = J A';(q)(v,v). 

One way to generate the mKdV energies is as follows (see Olver 

A(u) = -d 2 x - 4u 2 - 4u x d^u , 
and recall that A(u)d x is skew-adjoint: 

A(u)d x = -dl - Au 2 d x - Au x d- l ud x 

= -dl - Au 2 d x - Au x u + ^u x d~ x u x , 

where we used the formal integration by parts d^iufx) = —d^iuxf) + uf. 
With this notation we have the fundamental recursive identity: 

(2.4) «-W(«)= A («)«- J «-i(«), 
which together with skew-adjointness of A(u)d x shows that 

(I'^.dJ'^u)) = (I'j-Mtdvl't^u)) , 



). Let us put 



for j and k odd (if we use (2.4) with m even the choice l2 m ( u ) = 0> f° r m > is 
consistent). By iteration this shows that 

(2.5) (i;(u),d x I' k (u)) = 0, Wj,k. 

In fact, since j and k are odd we can iterate all the way down to j = 1 and apply 
@: 

(A{uW k+j _ x {u)) = - (d x u^ , ^2 9A j+ k-i («)/ dv$) 

e>o 

d x (A j+k -i{u))dx = . 



If u solves mKdV, then dtu = ^d x I' 3 (u) and hence by (2.5) we obtain 

dtijiu) = (i;(u),d t u) = ^(i;(u),dj' 3 (u)) = o. 

The following identities related to the conservation laws will be needed in £j9j Re- 
calling the definition (2.2) of Aj, we have: 
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Lemma 2.1. For any function u G S, and for b G C7°° R S' , we have 
(I[(u),(bu) x ) = (b x ,A 1 (u)) 
(I' 3 (u),(bu) x ) = 3(b x ,A 3 (u)) - {b xxx ,At{u)) 
(I' 5 {u), (bu) x ) = 5(b x ,A 5 (u)) - 5(b xxx ,A 3 (u)) + (b xxxxx , A x {u)) 

Proof. By taking arbitrary b G S, we see that the claimed formulae are equivalent to 
ud x I[{u) = d x Ai{u) , 
ud x I' 3 (u) = 3d x A 3 (u) - d^A^u) , 
ud x I' 5 (u) = 5d x A 5 (u) - 5d 3 x A 3 (u) + dlA x {u) , 

and these can be checked by direct computation. 

Lemma 2.2. For any function u,q G S, and for b G C°° fl S' , we have 
(l'!(Q)v,(bq) x ) ~ (d x I[(q),bv) = (b x , A\(q){v)) 
(l£(q)v,(bq) x ) - (dj'MM = S(b x ,A' 3 (q)(v)) - (b xxx , A'^v)) 
(I'Mv, (bq) x ) - {d x l' b {q)M = 5(b x ,A' 5 (q)(v)) - 5(b xxx ,A' 3 (q)(v)) 

+ (b xxxxx , A'^q)^)) 



□ 



Proof. Differentiate the formulae in Lemma [2Tl] with respect to u at q in the direction 
of v. □ 



3. Double soliton profile and properties 

Here we record some properties of mKdV and its double soliton solutions. The 
parametrization of the family of double solitons follows the presentation for NLS in 
Faddeev-Takhtajan [12] , 

The double-soliton is defined in terms of the profile q(x, a, c), where 

fl=(oi,a 2 )eM 2 , c = (ci,c 2 )Gl 2 \C, 

(3- 1 ) At 

C = {(ci, c 2 ) : ci = ±c 2 } Ulx{0}U{0}xl. 

The profile q = q% (from now on we drop the subscript 2) is defined by 

detMi 



(3.2) 
where 



q(x, a, c) 



detM 



M = [Mi 



ijtt<ij<2 > 



and 



Mi. 



7i = 1 exp(- Cj (x - a-j)), j = 1,2. 



1 + lilj 

Ci ~\~ Cj 



Mi = 


M 


7i 




72 




1 1 
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For conveninece we will consider the 



< c\ < c 2 

connected component of M 2 \ C throughout the paper. Since 

q(x, ai, a 2 , c u c 2 ) = -g(x, a 2 , a 1} c 2 , c x ) , 

q(x, ai, a 2 , -ci, -c 2 ) = -q(-x, -ai, -a 2 , c 1: c 2 ) , 

the only other component to consider would be, say, < —C\ < c 2 (see Figj2|, and 
the analysis is similar. 

We should however mention that in numerical experiments it is more useful to 
introduce a phase parameter e = (ei,e 2 ), €j = ±1, and define q(x,a,c,e) by (3.2) but 
with 7j's replaced by 



% = ("I) 
We can then check that 



j'-i 



e i exp(-c j (x - aj)), j 



1,2. 



q(x, a, c, e) = q(x, a, (eiCi, e 2 c 2 )) 

but q seems more stable in numerical calculations. 
The corresponding double- soliton 



(3.3) 



u(x, t) = q(x, ai + c\t, a 2 + c\t, c l5 c 2 ) 



is an exact solution to mKdV. For the double soliton this can be checked by an explicit 
calculation but it is a consequence of the inverse scattering method. This is the only 
place in this paper where we appeal directly to the inverse scattering method. Fig. [4] 
illustrates some aspects of this evolution. 
The scaling properties of mKdV imply that 



(3.4) 



q(x + t, a + (t, t),c) = q(x, a, c) 
q(tx,ta,c/t) = q(x,a,c)/t . 



Both properties also follow from the formula for q, with the second one being slightly 
less obvious: 



q(tx, ta, c/t) 



det tM 



det 



tM 


7i 




72 


1 1 






^?det 

det tM 

q(x, a, c)/t . 



t 





" 




1 











t 





Mi 





1 











1 










l/t 
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Propagating double soliton 




-2.5 -2 -1.5 -1 -0.5 0.5 1 1.5 

The right component of the decomposition The left component of the decomposition 




Figure 4. A depiction of the double soliton solution given by (3.3) 



The top figure shows the evolution of a double soliton. The bottom two 



figures show the evolution of its two components defined using (3.11). 
One possible "particle-like" interpretation of the two soliton interaction 
jl] is that the slower soliton, shown in the left bottom plot is hit by 
the fast soliton shown in the right bottom plot. Just like billiard balls, 
the slower one picks up speed, and the fast one slows down. But unlike 
billiard balls, the solitons simply switch velocities. 

Now we discuss in more detail the properties of the profile q. Recalling that we 
suppose that c 2 > c\ > 0, let 



, c x dcf 1 , / Ci + C 2 \ dcf 1 , f c 2 - Ci 

(3.5) an = — log , a 2 = — log ■ 

ci Vc 2 -ci/ c 2 V c i + C 2 



noting that for c 2 > C\ > 0, ot\ > and a 2 < 0. Fix a smooth function, 9 G 
L7°°(M, [0,1]), such that 



(3.6) 6{s) 



1 for s < — 1 
— 1 for s > 1 . 



IS 
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Data at t = 



6 




Double soliton with a2=-a1=3,c1=3,c2=5 

- - - Single soliton with c=3 
Single soliton with c=5 




3 






2 


A 




1 

n 







■ Double soliton with a1=-a2=3,c1=3,c2=5 

■ Single soliton with c=5 

■ Single soliton with c=3 





Evolution of the above data at t = -0.75 



Evolution of the above data at t = 0.75 





Figure 5. The top plots show show q(x, 3, 5, =)=3, ±3), the correspond- 



ing rj(x } aj } Cj) given by Lemma 3.2 The bottom plots show the post- 
interaction pictures at times =f0.75. Since the sign of a 2 — at changes 
after the interaction we see the shift compared to the evotion of 
r)(x, dj, Cj)'s. 



Define the shifted positions as 

(3.7) 

that is, 



dj = f a,j + aj6(ci2 — cii) 



dj + atj , a 2 <C ai 



dj — a.j , a 2 3> ai . 
see Fig. |5j We note that dj = dj(aj, 01,02). 
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Let S denote the Schwartz space. We will next introduce function classes S so \ and 
iSerr; and then show that q G <S so i and give an approximate expression for q with error 
in iS err . 

Definition 3.1. Let S err denote the class of functions, ip = <p(x, a, c), x G R, a G IR 2 , 
< 5 < ci < c 2 — 5 < 1/5 (for any fixed 5 ) satisfying 

\did k c d p a y\ < C 2 exp(-(|a; - a x \ + \x - a 2 \)/C 1 ) , 

where Cj depend on 5, I, k, and p only. 

Let S so \ denote the class of functions of(x,a,c) of the form 



Pi(ci, c 2 )<fi(c 1 (x - ax)) + p 2 {ci, c 2 )<p 2 (c 2 (x - a 2 )) + tp(x,a,c) 



where 



(1) \di<p 3 (k)\ < Ciexp{-\k\/C), for some C, 

(2) p 3 G C°°(R 2 \C). 

(3) <p G <S err . 

Some elementary properties of S so \ and S err are given in the following. 

Lemma 3.1 (properties of <S er r)- 

(1) 9 x S crr G S err , a ^S eTT c <S eTr , d C jS err c S err . 

(2) (x — (Z^)(S err G <^err and (yX — CLj^S crv G ^err • 

(3) If f e S e „ and f^™ f = 0, then d~ l f G S crr . 

The class <S err allows to formulate the following 

Lemma 3.2 (asymptotics for q). Suppose that < Ci < c 2 < C\/e < 1/e 2 , for e > 0. 
Then for \a 2 — a x \ > C /(ci + c 2 ), 



(3.1 



did k c dZ(q(x,a,c)-J2v( 

3=1 



Ob . j 5 Cj j 



< C 2 exp(-(|x - ai| + \x — a 2 \)/Ci) 



where C 2 depends on k,£,p and e, and C$, C\ on e only. In other words, 

2 

q(x,a,c) -^2rj(x, 

% i C 3 ) ^ '-'err • 

3=1 



Corollary 3.3. d x l d a q, d x 1 d c q G <S. 



sol ■ 



Proof. By Lemma 3.2, we have 



= d ej^2v(; & j, c j) + f 

3=1 
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where / G iS err . By direct computation with the 77 terms, we find that 

"+00 2 



/+00 
i=i 



By the remark in Lemma 



3.5 



3.1 3), we have d x 1 f G <S err . Hence 



we have f + °° d r .q = 0. Hence f + °° f = 0. By Lemma 



d x 1 d Cj q = d x 1 d Cj ^ r](-, a is c,) + S cn 
3=1 

and the right side is clearly in «S so i. 
Proof of Lemma 3.S\ We define 

(3.9) Q(x, a, 8) = f q(x, —a, a, 1 — 8, 1 + 8) 



□ 



so that, using (|3.4|), 
(3.10) 



q(x,a 1 ,a 2 ,c 1 ,c 2 ) 



Ci+C 2 q ( ( c x + c 2 



X 



a 



Ci + c 2 \ j a 2 — ai 



8 



ax + a 2 
2 

c 2 - Ci 
C 2 + Ci ' 



a, <5 



Hence it is enough to study the more symmetric expression (3.9). We decompose it 
in the same spirit as the decomposition of double solitons for KdV was performed in 

Q(x, a, 8) = t(x, a, 8) + r(—x, —a, 8) , 

1 (1 + 8) exp((l -6)(x + a)) + (1 - 8) exp((l +6){x- a)) 

2 8sech 2 (x — 8a) + 8' 1 cosh 2 (5x — a) 

This follows from a straightforward but tedious calculation which we omit. 
Thus, to show (3.8) we have to show that 

(3.13) 



(3.11) 
where 

(3.12) r(x,a,8) 



&d$(r(x, a, 8) - V (x - \a\ - log(l/5)/(l ± 8), 1 ± 8))\ 
< C 2 exp(-(|x| + |a|)/Ci) , ±a » 1 , 



uniformly for < 5 < 1 — e. 

To see this put 7 = (1 — 8)/(l + 8), and multiply the numerator and denominator 
of ((3121) by e^ 5 ^-^: 



(3.14) r(x,a,8) 



2(1-8) (l + 7 - 1 e 2 «- 2fe ) 



fi e (l-8)(x+a)n _ e -2x+28ay _|_ §-l e -(l-S)(x+a) M _|_ e -2Sx+2a\2 ' 
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Similarly, the multiplication by e ~( 1 + s )( x - a ) 
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(3.15) 

t(x, a, 6) 



gives 

2(1 + 5) (1 + ie - 2a+2Sx ) 



§ e (l+6){x-a) (l _ e -2x+2Sa\2 + fi-l e -(l+6)(x-a) n _|_ e ~26x-2a\2 

2(1 + 5) (1 + ie - 2a+26x ) (1 + e" 



§ e (l+5)(x-a) ((+ _ e -2x+25a^ + e ~25x-2a^ + $-l e -(l+S)(x-a) " 

This shows that for negative values of x, r is negligible: multiplying the numerator 



and denominator by 5 and using (3.14) for a < and (3.15) for a > 0, gives 
(3.16) 



t(x, a, 8) < 



S(i + 8)(1 + e -2([aM*[)) e -(i+*)(M+|a|) } a > o . 

5(1 + S)(1 + e 2*l*|-2[a[)-l e -(l-«)(M+[«|) ) a < o 



and in fact this is valid uniformly for < S < 1. Similar estimates hold also for 
derivatives. 



For x > 0, < 5 < 1 — e, and for a -C — 1, we use (|3.14|) to obtain 

1 



r(x, a, 5) = (1 — 5) sech ( (1 — 5) I x — \ct\ — 



1 



and for a 3> 1, (3.15): 



r(x, a, 5) = (1 + 8) sech ( (1 + 8) ( x — |a 



l-5 log 5 



1 i 1 

log 



where 



1 + 8 ° 8 



\d%e±\ < C fc exp(-(|x| + \a\)/c) , c>0 



+ e_(x, a, 5) 



+ e + (x, a, 8) 



uniformly in 8, < 8 < 1 — e. Inserting the resulting decomposition into (3.10) 
completes the proof. □ 

Lemma 3.4 (fundamental identities for q). With q = q(-,a, c), we have 



(3.17) 



d x I' 3 (q) = 2d x (-d 2 x q - 2g 3 ) = 2 ^ c)d a ,q . 

i=i 



(3.18) 



9,^(g) = 29 1 .g = -2^9 aj g. 



(3.19) 



3=1 3=1 



These three identities are analogues of the following three identities for the single- 
soliton rj = rj(-, a, c), which are fairly easily verified by direct inspection. 

d x I[(rj) = d x r] = -d a r] 
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dJ'M = d x (-d 2 xV - 2 V 3 ) = c 2 d a r] 
r] = (x — a)d a rj + cd c i] 



Proof. The first identity is just the statement that (3.3) solves mKdV and we take it 



on faith from the inverse scattering method (or verify it by a computation). To see 
(3.18) and ( 3.19[ ) we differentiate (3.4) with respect to t. □ 



The value of Ij(q) for all j is recorded in the next lemma. 
Lemma 3.5 (values of Ij{q))- 
(3.20) I Q {q) = 2tt 

For j = 1,3,5, we have 

j-i c[ + cj 



(3.21) 
Also, 
(3.22) 



IM) = 2(-l) 



j xq(x, a, c) 2 dx = 2a\C\ + 2a2C2 . 



Note that by (3.20), 



d ai q = 0, 



cLg = 0, j = l,2. 



from which it follows that d x 1 {d a q) and d x 1 {d c .q) are Schwartz class functions. 



Proof. We prove (3.21 ), (3.20) by reduction to the 1-soliton case. Let u(t) = q(-, a\ + 
tc(, a2 + tc 2 ,, ci, C2). Then by the asymptotics in Lemma 3.2 

2 

k=l 

where 

Mt)\ < (c 2 {{a 1 + tcl)-{a 2 + tcl)))- 2 
But note that by scaling, 

Ijivi', K + cltYi c fc)) = 4 7 i(^) 
By sending t —> +00, we find that 

To compute Ij(rj), we let r} c (x) = crj(cx). By scaling Ij(r) c ) = c^Ij{rj). Hence 
jljirj) = dcl^Jjfa) = (/;(??), <9 C j c=1 r/ c ) 

= (ijfa), (x V ) x ) = 2(-1)^<ij, (x V ) x ) = 2(-l)^ , 
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where we have used the identity 

(3.23) I' J ( V ) = 2(-1)^ V , 

which follows from the energy hierarchy. In fact, I[(r)) = 2i] is just the definition of 
I[. Assuming that I'Ar]) = 2(— l)~2~r/, we compute 

dj' j+2 (rj) = Hv)d x I'M 

= 2(-l)(-l)^(d 2 x + V + ^ x d~\) nx 
= 2{-\)^d x {j) xx + 2r?) 
= 2(-l)^d xV 



xu 



We now prove (3.22). By direct computation, if u(t) solves mKdV, then d t J 
-3I 3 (u). Again let u(t) = q(-, a x + tc 2 , a 2 + tc 2 ,, c\, c 2 ). By (3.21 ) with j = 3, we have 



xq(x, a, c) 2 dx = / xu(0,x) 2 dx = / xu(t, x) 2 dx — 2{c\ + c 2 )t 



By the asymptotics in Lemma 3.2 



xu(t, x) 



2 r 

/ xr l( 
i=i J 



X, (Cbj + tC 2 )\ Cj) 2 + U)(t) 



where 



But 



\u(t)\ < (oi + tcl)(c 2 ({ai + eft) - (a 2 + tc 2 2 ))Y 



I 



Combining, and using that ciai + C2S2 = Ciai + 0202, we obtain 

a;g(x, a, c) 2 dx = 2(ciai + C2CI2) + 
Send t —7- +00 to obtain the result. 

We define the four-dimensional manifold of 2-solitons M as 

M = {q(-,a,c)\a= (ai,a 2 ) ER 2 ,c= (a, c 2 ) G (M) 2 \ C } 



□ 



Lemma 3.6. The symplectic form (2.1) restricted to the manifold of2-olitons is given 
by 

2 

oj\m = dctj A dej . 
3=1 

In particular, it is nondegenerate and M is a symplectic manifold. 
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Proof. By (|3_21| with j = 1 and fl3.18| ), 

= ^daj^q) = -(I[(q),d ai q) = {d ai q,d~ l d ai q) + {d^d^d^q) 
= (9a 2 Q,d~ 1 d ai q) 
Again by fl3~2~Ij ) with j = 1 and flO?) , 

(3.24) 1 = l -d c JM = \(l'M,d Cl q) = {d ax q,dZ X d Cl q) + (d a2 q,d^d Cl q) 



By (|3_21j) with j = 3 and (|3_17j), 

(3.25) -c 2 = \d c jM = \{i'MM = -clida^d-^q) -cl(d aA d- l d cl q) 



Solving (3.24) and (3.25), we obtain that {d^q.d^d^q) = 1 and {d a2 q,d~ l d Cl q) = 0. 
We similarly obtain that (d a2 q,d~ 1 d C2 q) = 1 and (d^q^d^d^q) = 0. It remains to 
show that (<9 Cl g, c^c^g) = 0: 



1 2 

{d Cl q, d x l d C2 q) = — (V c,d g, 1 d C2 q) 
ci z — ^ 

1 2 

= — (<? - ^,{x - aj)d aj q, d x l d C2 q) 



by (3.19) 



3=1 



1 1 2 

— (g + xq x , d~ l d C2 q) + — V* g, d^d^q) by ( |3.18| ) 
Ci ci -f-f 1 r 



-a C2 / xq + 



2 , ^2 
Cl 



by (3.22) 



□ 



Remark. If |ai — aa| ^> 2, and c\ < C2 then, in the notation of (3.7), 

3=1,2 j=X,2 

that is the map (a, c) h-> (a, c) is symplectic. 



The nondegeneracy of the symplectic form (2.1) restricted to the manifold of 2- 
olitons, M shows that if 2 functions close to M can be uniquely decomposed into 
an element q, of M and a function symplectically orthogonal T q M. We recall this 
standard fact in the following 

Lemma 3.7 (Symplectic orthogonal decomposition). Given c, there exist constants 
5 > 0, C > such that the following holds. If u = q(-,d,c) +v with ||5||H a — then 
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there exist unique a, c such that 
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c\ < cm 



H 2 



\a — a\ < C\\v\\h2 , 
and v d = u — a, c) satisfies 
(3.26) (v,d~ l d ai q) = and {v,d- l d c .q) = 0, j = 1,2. 



Proof. Let : H 2 x 



x 



) 2 — > M 4 be defined by 



<p(u, a, c) 



(u - q(- 1 a,c),d~ 1 d ai q) 
(u - q(-,a,c),d- 1 d a2 q) 
(u - q(-,a,c),d- 1 d Cl q) 
(u- q(-,a,c),d~ 1 d C2 q}_ 



Using that u;| = da\ A dc\ + <ia2 A c?C2, we compute the Jacobian matrix of cp with 



respect to (a,c) at (q(-, a, c), a, c) to be 

D a ,Mq(-,a,c),d,c) 



'0 10' 

1 
10 
10 



By the implicit function theorem, the equation (p(u, a, c) = can be solved for (a, c) 
in terms of u in a neighbourhood of q(-, a,c). □ 

We also record the following lemma which will be useful in the next section: 

Lemma 3.8. Suppose v solves a linearized equation 



d t v = -d x I'z{q{t))v = d x {-dl - 6q(t) 2 )v , q(x, t) = q(x, aj + tc? Cj) . 



Then 



d t (v(t),d^(d Cj q)(t)) = d t (v(t),d-\d aj q)(t)) = 0, 

where (d Cj q)(t) = (d C] q)(x,aj + tc 2 ,Cj) (and not d Cj (q(x,aj + tc 2 ,Cj))). In addition, 
forv(0) = d aj q, v(t) = (d aj q)(t), and for v(0) = d Cj q, 

v(t) = (d Cj q)(t)+2 Cj t(d aj q)(t). 

4. Lyapunov functional and COERCIVITY 

In this section we introduce the function H c adapted from the KdV theory of 
Maddocks- Sachs [23]. We will build our Lyapunov functional £ from H c . 
Thus let 

H c (u) ^ h{u) + (cl + cj)I 3 (u) + c\c\h{u) . 
We give a direct proof that q(-, a, c) is a critical point of H c : 
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Lemma 4.1 (q is a critical point of H). We have 
(4.1) H'M-,a,c)) = 0, 

that is 

I'M + (ci + cl)I' 3 (q) + cicll[(q) = 0. 
Proof. We follow Lax [231 §2]: we want to find A = A(q) and B = B(q) such that 

H>(q)^r,(q) + AI' 3 (q) + BI[(q) = 0, 



for all q = q(x,a,c) G M. If we consider the mKdV evolution of q given by (3.3), 
then Lemma |3~2 shows that as t — > ±oo we can express H'(q) asymptotically using 
H'(r) Cl ) and H'(j] C2 ). From (3.23) we see that 

H'( Vc ) = I' 5 ( Vc ) + AI' 3 ( Vc ) + BI[( Vc ) = 2(c 4 - Ac 2 + B) Vc . 

Two parameters c\ and C2 are roots of this equation if A = c\ + c\ and B = cfc 2 , and 
this choice gives 



(4.2) 



H'(q(t)) = r(t) , \\r(t)\\ L 2 < Cexp(-\t\/C) 



q{t) = f q(x, ax + eft, a 2 + c^t, ci,c 2 ) 



where the exponential decay of r(t) comes from Lemma 3.2 and the fact that c\ ^ c 2 . 

To prove (4.1) we need to show that r(0) = 0. For the reader's convenience we 
provide a direct proof of this widely accepted fact. Since it suffices to prove that 
(r(0),w) = 0, for all w G S, we consider the mKdV linearized equation at q(t), 



1 



d x I'Mt))v, v{0) = weS, 



(4.3) 

and will show that 

(4.4) d t (r(t),v(t))=d t (H'(q(t)),v(t)) = 0. 
The conclusion (r(0), w) — will the follow from showing that 

(4.5) (r(t),v(t)) -> 0, t->oo. 
We first claim that 

d t (I , k (q),v) = 0, Vk. 



In fact, from (2.5) we have (I' k ((p) , d x I' 3 ((p)) = for all <p G S. Differentiating with 
respect to cp in the direction of v, we obtain 
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Applying this with v = v(t) and tp = q(t) we conclde that 

dt(I'M,v) = (I'l{q)d t q,v) + \{l' k [qWl{q)v) 

= ^i*(?)^(?),«> + ^ii(g),^(?)t;>, 
= o. 



Since if is a linear combination of J^'s, k = 1,3,5, this gives (4.4). 

We now want to use the exponential decay of ||r(t)|| £ 2 in (4.2), and (4.4) to show 
( 4.5[ ). Clearly, all we need is a subexponential estimate on v(t), that is 

(4.6) Ve>0 3t , \\v(t)\\ L 2 < e et , t > t . 

Let ip be a smooth function such that ip(x) = 1 for all |x| < 1 and ip(x) ~ e~ 2 ' x ' 
for |x| > 1. With the notation of Lemma [3~2 define 

tpj(x,t) — ip{8{x — (a, + Cjty~)). 

for < 5 1 to be selected below and j = 1,2. We now establish that 

2 



(4.7) 



9t \\v 



|2 

\L 2 



V 



x\\L 2 



+ 61 q 2 v 2 



< 



E 



To prove (4.7), apply d x 1 to (4.3) and pair with vt to obtain 

= {d~ l v t ,v t ) + (v xx ,v t ) + (6q 2 v,v t ) 



which implies 
(4.8) 



t 



1 



J x\\L 2 



2 2 

q v 



6 / qq t v" 



Next, pair (4.3) with v to obtain 

= (v t ,v) + (v xxx ,v) + 6(d x (q 2 v),v) 



which implies 
(4.9) 



dtlMlia = -12 J qq x v 2 



Summing (4.8) and (4.9) gives (4.7). 

The inequality (4.7) shows that we need to control is \\ipjV (t)\\, j = 1,2. For t large 
ijjj provides a localization to the region where q decomposes into an approximate sum 
of decoupled solitons (see Lemma 3.2). Hence we define 

Cj = c 2 — d 2 — 6i] 2 (x, (a,j + tc 2 y, Cj) 

(see also §[8] below for a use of similar operators). A calculation shows that 

(4.10) t>T(5) t (C^v,ij j v} = O(5)\\v\\ 2 Hl} 



2cS 
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where T(6) is large enough to ensure that the supports of ^'s are separated. It 
suffices to assume that v (0) = w satisfies (w, d^d^q) = and (w, d^d^q) = 0, since 
Lemma 3.8 already showed that the evolutions of d aj q and d Cj q are linearly bounded 
in t. Under this assumption, we have by Lemma 3.8 that (v(t),d~ l d a .q(t)) = and 
(v(t),d~%q(t)) = 0. 

We now want to invoke the well known coercivity estimates for operators Cj - see 
for instance [18"| §4] for a self contained presentation. For that we need to check that 



| (ipjv, d x l d a i](a j + tc 2 , cj)) | < 1 , | (ipjV, d x l (d c r)(a j + tc*, c 3/ 



< 1 



This follows from the fact that v is symplectically orthogonal to (d c .q)(t) and d aj q(t) 
(Lemma 3.8 again), the fact that q decouples into two solitons for t large, and from 



the remark after the proof of Lemma [376 
Hence, 

(CjipjV^jv) > \\ipjv\\ 2 H i . 

We now sum ( 4.7[ ) and (4.10) multiplied by to obtain, for t sufficiently large 
(depending on 5), 



F(t) = \\v 



F'(t) < C5^F(t) , 
6 / q 2 (t)v(t) 2 + d'Hc^it^^^Mt)) 



(where we added the additional J q 2 v 2 term to the right hand side at no cost). Con- 
sequently, F{t) < exp(C'5h), for t > T\(5). 

We recall that this implies (4.6) and going back to (4.4) show that r(0) = 0, and 
hence H\q) = 0. 



□ 



We denote the Hessian of H c at q(», a, c) by /C Cj(J : 

K, c , a = r^q) + {cl + c 2 2 )r>{q) + c 2 Al"{q) 
It is a fourth order self-adjoint operator on L 2 (IR) and a calculation shows that 



(4.11) 



+ Wd x q z d x + 10(-g 2 + (q% x + 3g 4 ) - Q(cf + c\)q 2 



Lemma 4.2 (mapping properties of K,). The kernel of K c ^ a in L 2 (M) is spanned by 
d a A: 



(4.12) 

and 

(4.13) 



K c>a d c .q = i(-iy Cj (ci - 4)d- L d a .q 
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Proof. Equations (4.12) follow from differentiation of (4.1) with respect to cij. As 
x — > oo, the leading part of /C Cja is given by (— d 2 + c\)(— d% + c%) and hence the kernel 
in L 2 is at most two dimensional. 
To see (4.13) recall that 

I' 1 (q) = 2q = -2d- 1 (d ai q + d a2 q) 

I'M = -2g" - 4g 3 = 2d~ 1 (c 2 1 d ai q + c\d a2 q) , 
where we used Lemma 3.4. By differentiating H'(q) = I' 5 (q) + (c\ + c\)I' 3 (q) + 
c i c 2-^i(<?) = with respect to Cj, we obtain 

(4.14) JC(d Cl q) = -2 Cl (l' 3 (q) + c 2 2 I[(q)) , K{d C2 q) = -2c 2 (l' 3 (q) + cll[(q)) . 



Inserting the above formulae for I[(q) and I' 2 (q) gives (4.13). □ 
The main result of this section is the following coercivity result: 

Proposition 4.3 (coercivity of /C). There exists S = 5(c) > such that for allv e H 2 
satisfying the symplectic orthogonality conditions 

(v, d^da.q) = and (v, d^d^q) = , j = 1, 2 , 

we have 

(4.15) 5\\v\\ 2 H2 < ()C c>a v,v}. 



The proposition is proved in a few steps. In Lemma 4^ we already described the 
kernel 1C CA and now we investigate the negative eigenvalues: 

Proposition 4.4 (Spectrum of /C). The operator /C C)(1 has a single negative eigen- 
value, h G L 2 (1R): 

(4.16) K, c ^ a h = —fih, ji > . 
In addition, for 

0<5<ci<c 2 -<5<1/<5, 
there exists a constant, p, depending only on 5, such that 

(4.17) min{A > : A G a(JC Cia )} > p , aeM 2 , 

Proof. As always we assume < c\ < c 2 . We know the continuous spectrum of /C Cia , 

0"ac(£c,a) = [2CiC2, +Oo) 

and that for all a, c, there is a two-dimensional kernel given by span{<9 ai g, d a2 q}. The 
eigenvalues depend continuously on a, c, and hence the constant dimension of the 
kernel shows that the number of negative eigenvalues is constant (since the creation 
or annihilation of a negative eigenvalue would increase the dimension of ker /C c>a .) 

Hence it suffices to determine the number of negative eigenvalues of K. for any 
convenient values of a, c. To do that we use the following fact: 
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Lemma 4.5 (Maddocks- Sachs [2U Lemma 2.2]). Suppose that K is a self- adjoint, 
4th order operator of the form 

K = 2{-d 2 x + c 2 )(-«9 2 + c 2 ) + p (x) - d xPl (x)d x , 

where the coefficients pj(x) are smooth, real, and rapidly decaying as x — >■ ±oo. Let 
r\(x), r 2 (x) be two linearly independent solutions of JCrj = such that rj — > as 
x — > — oo. 

Then the number of negative eigenvalues of /C is equal to 

ri(x) r[(x) 
T2(x) r' 2 (x) 



(4.18) 



dim ker 



We apply this lemma with /C = /C Cja , in which case 

Pl = 20q 2 , p = 40q xx q + 20q 2 x + 60g 4 - \2{c\ + c\)q 2 



q = q(;a,c) . 



Convenient values of a and c are provided by ai = a 2 = and c\ = 0.5, c 2 = 1.5. In 
the notation of (3.9) we then have q(x,a,c) = Q(x, 0,0.5), and since 

d x Q = -d ai q - d a2 q , d a Q = -d ai q + d a2 q , 



we can take r x = d x Q and r 2 = d a Q. A computation based on (3.11) and (3.12) 
shows that 



Q(x, 0.5, 0) = sech(x/2) , d x Q(x, 0.5, 0) 



sinh(x/2) 
2cosh 2 (a;/2) 



(4.19) 



r, ^, s sinh(x/2) , , ,, , ,. 

d a Q{x, 0.5,0) = \ / r9-2cosh 2 (a;/2)) 



4cosh 4 (x/2) 
9sinh(x/2) 



+ d x Q(x, 0.5,0) 



4cosh 4 (x/2) 

Since i4i/ = sinh(a;/2) is invertible, we only need to check the dimension of the 
kernel the Wronskian matrix of 

y ~ / \ y 



n(?/) 



i + y 2 



[1 + 2/ 



2^2 



and that is equal to 1 at y = and on K\ {0}. In view of (4.18) this completes the 
proof of (4.16) 



To prove (4.17) we first note that by rescaling (3.10) we only need to prove the 
estimate for 



K(c, a) = f K,( 



'((ci),(-a,a)) » ce[5, 1-5], 0<5<l/2. 
For that we introduce another operator 

(4 20) P(c) — f / q2 1 lW a2 1 " 2n 1 inQ ~ 2q 1 in '°- 2 °~ 4a 1 ~ 2n - 2 

where 



;-9 2 + l)(-<9 2 + c 2 ) + lOc^ 2 ^ + 10(3rf - 2r/ 4 ) - 6(1 + c 2 )rf 
77 = sechx , c G M+ \ {1} . 
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The operator P(c) is the Hessian of H( c ,i) & t ?7, which is also a critical point for Hr c ,i)- 
In particular, 

P(c)d x r] = . 

Putting, 



and 



UJ{x) = f(x + a + log((l + c)/(l - c)))) 



p + (c,«) d =f/:p( C )f/ Q 



we see that 

if (c, a) = 2P + (c, a) + C( e - (Q+|:E|)/C )^ + 0(e~ {a+lxl)/c ) , x > . 
Similarly, if 

TJ(x) = y/cf(cx) , 

and 

P_(c,a) d ^ c 2 U a T c P(l/c)T;U*, 

then 

tf(c, a) = 2P_ (c, a) + 0( e - (a+|a:|)/c )^ + e>( e - {a+N)/c ) , x < . 



We reduce the estimate (4.17) to a spectral fact about the operators P(c) and 
P(l/c): 

Lemma 4.6. Suppose that there exists 

a i — ► X(c,a) 6l \ {0} 

sitc/i t/iat 

A(c, a) e a(K(c, a)) , A(c, at) — >■ , a — > oo . 

Then we have 

(4.21) dimker i2 P(c) + dimker L 2 P(l/c) > 2 , 
where ker^2 means the kernel in L 2 . 

Proof. The assumption that ^ A(c, a) — > as a — > oo implies that there exists a 
family of quasimodes / a , ||/ a |U 2 — 1> 

(4.22) ||K(c,«)/ a || i2 = o(l), a^oo, / a ± ker i2 #(c, a) . 

Since we know that the kernel of K(c, a) is spanned by U*d x r] + 0( y e-^ +a ^ c ) and 

we can modify f a and replace the orthogonality condition 

by 

f a I span (U*d x r), U a T c d x rj) . 



The estimate in (4.22), and ||/a||L 2 = 0(1), imply that 
(4.23) \\f a \\ H2 = 0(l), a^oo. 
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We first claim that 

12, 



(4.24) 

In fact, on [—a/2, a/2], 



\f a (x)\ dx = o(l) , a — > oo 



-i 



K(c,a) = {-dl + c 2 ){-dl + 1) + 0{e-«' c )dl + 0(e 



a/c\ 



and hence, using (4.23), 

(-<9 2 + c 2 )(-<9 2 + 1)/ Q = r a , ||r Q || L 2 ([ _ Q/ 2, a /2]) = o(l) . 

Putting 

e a = [(-dl + c 2 )(-<9 2 + (r a l[_ a / 2 ,a/2]) , = o(l) 

we see that f a = g a + e a where 

(4.25) (-d 2 + c 2 )(-«9 2 + l)£ a (:r) = 0, |x|<a/2. 



Suppose now that (4.24) were not valid. Then the same would be true for g a , and there 
would exist a constant c > 0, and a sequence aj — > oo, for which \\g a . \\l 2 (\-i,i]) > Co- 
in view of (4.25) this implies that 

9«A X ) = E ( a t e±X + b t e±CX ) > H < «/ 2 > \4l \ b t\ = > 



and for at least one choice of sign, 

\af\ 2 + \bf\ 2 > Cl >0. 

We can choose a subsequence so that this is true for a fixed sign, say, +, for all j. In 
that simple calculation shows that for Mj — > oo, Mj < a.,/2, 



A-L 



1 1 1 |2„2M,- , 1 |7 + i2 2cAf,- 



\g aj (x)\ 2 dx>^\4\ 2 e^ + -\bl\ 2 e 



> 



1-c 

1 (l-c 

2 ViT^ 

4 



2c 1 
ot||6j|e 

2 



c+1 



a+||6+|e( c+1 ^' 



(l-c)Mj _ 



0{l) 



|2 e 2A ^ + -|6+|V 
c J 



1 



| a +|2 e2 (l-c)M, 



(1 — C) 2 ' J 

where we used the fact that 0<5<c<l — 5. Hence 



o(i), 



AI, 



\\f aj \\L*> / \UAx)\ 2 dx> / \g a .(x)\ 2 dx - o(l) 



AL 



> 



1 (1-C 



2 Vl + c 



Cie * M ' c - 0{l) — >• oo, j — > oo 



Since ||/ a |U 2 — 1 we obtain a contradiction proving (4.24). 
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Now let x±C°°(M) be supported in ±[-1, oo), and satisfy X+ + X- = 1- Then ( |4.24[ ) 
(and the corresponding estimates for derivatives obtained from ( 4.22[ )) shows that 

\\P±{c,oi){x±f a )\W = > « — 

For at least one of the signs we must have ||x±./a||L 2 > 1/3 (if a is large enough), 
and hence we obtain a quasimode for P±(c,a), orthogonal to the known element of 
the kernel of P±{c, a). This means that P±{c, a), for at least one of the signs has an 
additional eigenvalue approaching as a — > oo. Since the spectrum of P±(c,a) is 
independent of a it follows that for at least one sign the kernel is two dimensional. 
This proves (4.21). □ 

The next lemma shows that ( |4.21 ) is impossible: 

Lemma 4.7. For c eR + \ {1} 

(4.26) ker L 2 P(c) = C • d x rj . 

Proof. Let C = (/^(r?) + I'i{r]))/2: 

Cv = —v xx — 6r] 2 v + v , i](x) = sech(x) . 



We recall (see the comment after (4.20)) that 
Pic) 



\h'U{v) = \ (#fo) + (i + c 2 )HM + c 2 r;( v )) 



We already noted that 



C(d x rj) = P(c)d x r] = , 



and proceeding as in (4.14) we also have 

(4.27) C(d x (x V )) = -2 V , P(c)(d x (x V )) = 2(1 - c 2 ) V . 
We claim that 

(4.28) P(c)d x C = Cd x P(c) 



Since Ij(rj + tv) = tl"{rf)v + 0(t ), v G S, the equation (2.5) implies that 

( I j( r ]) v , d x l'k(rj)v} = , Vj,fc, veS. 

From this we see that 

(P(c)v, d x Cv) = , Vt; E S , 

and hence by polarization, 

(P(c)v,d x Cw) = -(P(c)w,d x Cv) = (d x P(c)w,Cv) . 



which implies (4.28). 

Suppose now that dimker^ P{c) = 2 for some c ^ 1, and let rj x and ip be the basis 
of this kernel. Since P{c) is symmetric with respect to the reflection x \- > —x, ip can 
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be chosen to be either even or odd. Applying (4.28) to ip we get P(c)d x Cip = and 
hence 

d x Cip = otr) x + Pip , 

for some a, /3 G R. 

If ip is odd then d x £ip is even, and therefore a — /3 — 0. But then ip G ker^2 C = 
C ■ rj x , giving a contradiction. 

If t/> is even then d x £ip is odd, /3 = and £0 = a?]. We have a ^ since ip is 
orthogonal to the kernel of C, spanned by d x rj. From (4.27) we obtain 

i/> = -^d x {xr]) . 



Applying the second equation in (4.27) we then obtain 

P(c)ip = -a(l - c 2 )r], 

contradicting ip G ker L 2 P(c). 



With this lemma we complete the proof of Proposition 4.4 



□ 
□ 



To obtain the coercivity statement in Proposition 4J3 we first obtain coercivity 
under a different orthogonality condition: 

Lemma 4.8. There exists a constant p > depending only on c\, C2, such that the 
following holds: If {u,d~ l d ai q) = 0, {u,d~ l d a2 q) = 0, (u,d ai q) = 0, (u,d a2 q) = 0, 
then ()C Cta u,u) > p\\u\\ 2 L2 . 



Proof. To simplify notation we put K = JC c>a in the proof. Using (4.13) and the 
expression for the symplectic form, lo\ = da 1 A dc\ + da 2 A dc 2 , we have 



(K.d ci q, d ci q) = -4ci(c? - c 2 2 )(d x 1 d ai q, d Cl q) = 4ci(c? - c; 



and similarly 
(4.29) 



()Cd C2 q, d C2 q) = -4c 2 (ci - c\) 



Since we assumed that c\ < c 2 , ()Cd Cl q,d cl q) < 0. 

Let d Cl q be the orthogonal projection of d ci q on (ker/C)" 1 . We first claim that there 
exists a constant a such that u = u + ad Cl q with (u, h) = 0, where \i and h are defined 



in Proposition 4.4 



To prove this, decompose d ci q as d ci q = £ + f3h with (£, ft,) = 0. Then by (4.29) 

> (}Cd Cl q,d Cl q) 
= (K4,£) + 2p(Kh,£) + p 2 (lCh,h) 
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Since (/C£, £) > 0, we must have that (5 ^ 0. Hence there exists v! and a such that 
m = u' + cid Cl q with («', /i) = 0. Now take u to be the projection of v! away from the 
kernel of /C. This completes the proof of the claim. 
We have that 

(u,JCd Cl q) = -4ci(c2 -c\)(u,d~ l d ai q) = 



by (4.13) and hypothesis. Substituting u = u + ad Cl q, we obtain 

(4.30) (u,)Cd Cl q) = -a(d Cl q,K.d Cl q) = -a(d Cl q,JCd Cl q) 

Now let p denote the bottom of the positive spectrum of /C. We have 

(/Cm, u) = (JC(u + ad Cl q), (it + ad Cl q)) 

= (/Cm, u) + 2a (/Cm, d ci q) + a 2 (JCd Cl q, d ci q) 



= (/Cm, u) - a 2 (ICd Cl q, d Cl q) by p3o| ) 

> p\\u\\ 2 L2 + 4 Cl (c 2 2 - cl)a 2 

>C(\\u\\ 2 L2 +a 2 ) 

where C depends on ci, c 2 and p. However, since u = u + ad Cl q, we have 

hll! 2 <^(ll^ll| 2 + « 2 ) 

where C depends on ci, c 2 which completes the proof. □ 
We now put 

E = E atC = ker K = span{<9 ai g, d a2 q] , 
(4.31) F = F a , c = spanj^ 1 ^^, d^d^q) , 

G = G atC = span{^' 1 9 01 g, d^d^q} . 
In this notation Lemma 14.81 states that 

u±(E + G) =^ (Ku,u) > 9\\u\\l 2 , 
while to establish Proposition |4.3| we need 

m _L (F + G) => (/Cm,m) > e\\u\\ 2 L2 . 

That is, we would like to replace orthogonality with the kernel E by orthogonality 
with a "nearby" subspace F. For this, we apply the following analysis with D = F L . 

Definition 4.1. Suppose that D and E are two closed subspaces in a Hilbert space. 
Then a(D,E), the angle between D and E, is 

a(D,E) = f cos -1 sup (d, e) 

\\d\\=l, deD 



:!(» 
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It is clear that < a(D,E) < tt/2, a(D,E) = a(E,D), and that a(E,D) = tt/2 
if and only if E _L D. We will need slightly more subtle properties stated in the 
following 

Lemma 4.9. Suppose that D and E are two closed subspaces in a Hilbert space. Then 

(4.32) a(D,E) = cos' 1 sup ||P B d|| , a(D, E) = sin -1 inf \\P E ±d\\ . 

\\d\\=i,deD \\d\\=i,d£D 

In addition if E is finite dimensional then 

(4.33) a(D,E) = ^ D n E ^ {0} . 

= 1. By the definition of the projection 



Proof. To see ( |4.32[ ) let d E D, with 
operator, 



1 - \\P E df 



\d — PEd\\ 2 = inf lid — el 

1 ee£ 1 



inf inf (1 — 2a (d, e) + a z 



inf inf lid — aell 2 

eEE aeM 

inf (l-(d,e> 2 ) 

ee_E 



1 — sup (d, e) z 

eS-B 
l|e|I=l 



and consequently, 



\Pe&\\ = sup (d, e) 

eS-E 



from which the first formula in (4.32) follows. The second one is a consequence of the 
first one as 1 = \\P E d\\ 2 + \\P E ±d\\ 2 . 

The ■<= implication in (4.33) is clear. To see the other implication, we observe that 
ifDfl£ , = {0} and E is finite dimensional then 

inf d{y, D) > , 
y&E 

IMI=l 

where d(y, .D) = inf ze £> ||y — z|| is the distance from y to D. This implies that 
0< inf inf lly - d| 2 = inf inf (1 - 2(y, z) + \\z\\ 2 ) 

Hl/Il=l llwll=i 
< inf inf (2 — 2(y, z)) = 2(1 — sup sup (y,z)) 

\\y\\= l H 2 II =1 1^11=111^1=1 

= 2(1 - cosa{D,E)) . 



Thus if £> H £ = {0} then a(D, E) > 0. But that is the implication in (4.33). □ 



In the notation of (4.31), the translation symmetry gives 

a{E a<c , F^c) = F(a, c 2 , at - a 2 ) , 
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where F is a continuous fuction in C x M. We claim that 

(4.34) F(ci, c 2 , a) > k s > for 5 < c x < c x + 5 < c 2 < 6' 1 . 

Consider now the case \a\ — a 2 | < A (where A is chosen large below), and hence ci, 
c 2 , and ai — a 2 vary within a compact set. Thus it suffices to check that a(E ajC , F^ c ) 
is nowhere zero and this amounts to checking E D F x = {0}. 
Suppose the contrary, that is that there exists 

u = zid ai q + z 2 d a2 q G F 1 - . 

Since w| M = da\ A dc\ + da 2 A dc 2 , 

= (u,d~ l d c .q) = 0. 



This proves (4.34). To complete the argument in the case \a\ — a 2 | < A, we need: 



Lemma 4.10. Let E = ker)C, and suppose that G is a subspace such that E _L G 
and the following holds: 

u±(E + G) =^ (K.u,u) > 9\\u\\ 2 L2 . 

Then, for any other subspace F we have 

u±(F + G) => {Ku,u} > 6 sin 2 aiE^F 1 ) \\u\\ 2 L2 . 

Proof. Suppose u _L (F + G) and consider its orthogonal decomposition, u = Peu + u. 
Since E _L G and u J_ G, we have ul (-E + G). Hence, by the hypothesis we have 

(JCu,u) = (JCu,u) > 0\\u\\ 2 L 2 = 9\\P E ±u\\ 2 L 2 . 



An application of (4.32) 



sma(E,F ± )= inf \\P E ±d\\v < ^ L '-"^ 



i" i|m||l2 



concludes the proof. □ 

5. Set-up of the proof 

Recall the definition of T (for given S > and a, c) stated in the introduction. 
Recall 

B{a,c 1 t) A = J b(x,t)q 2 (x,a,c) dx . 

In the next several sections, we establish the key estimates required for the proof of 
the main theorem. Let us assume that on some time interval [0,T], there are C 1 
parameters a(t) G M 2 , c(t) G M 2 such that, if we set 

(5.1) v(;t) d ^u(;t)-q(;a(t),c(t)) 
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then the symplectic orthogonality conditions (3.26) hold. Since u solves (1.1), v(t) 
satisfies 

(5.2) d t v = d x (-d 2 x v - 6q 2 v - 6qv 2 - 2t» 3 + bo) - F 
where F Q results from the perturbation and d t landing on the parameters: 

2 2 

(5.3) F Q = ^{aj - c])d a] q + ^ Cjd c .q - d x {bq) 

3=1 3=1 

Now decompose 

Fo = F\\ + F± 

where F\\ is symplectically parallel to M and F± is symplectically orthogonal to M. 
Explicitly, 

2 2 

(5.4) F\\ = J2&3 ~ c? + ¥c^)d a] q + - \d aj B)d Cj q 

3=1 3=1 
1 2 

(5.5) F ± = -d x (bq) + - J2[-(d Cj B)d a .q + (d a .B)d Cj q] 

3=1 

All implicit constants will depend upon 5q > and L°° norms of bo(x,t) and its 
derivatives. We further assume that 

(5.6) 8 < ci(t) < c 2 {t) - S < 5^ 

holds on all of [0, T]. 

In [|6] we will estimate Fj_ using the properties of q recalled in [|3j We note that 
i 7 !! = would mean that the parameters solve the effective equations of motion (1.4). 
Hence the estimates on F^ are related to the quality of our effective dynamics and 
they are provided in ^7} In ^8] we then construct a correction term which removes the 
leading non- homogeneous terms from the equation for v. Finally energy estimates in 



| based on the coercivity of /C lead to the final bootstrap argument in §10 

6. Estimates on F\ 



Using the identities in Lemma 3.4, we will prove that F± is 0(h ); in fact, we obtain 
more precise information. For notational convenience, we will drop the t dependence 
in b(x,t), and will write b', b", b'", to represent x-derivatives. 



We will use the following consequences of Lemma 3.2 



(6.1) 



d aj q = -d x r)(-,aj,Cj) + S t 
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and 

(6.2) Cjd c .q = d x [(x - aj)i](x, a j: c,-)] + 2c3 ~ 3 °^ 2 — °lLd x r](x, &j, c,-) 

[Ci + c 2 ){cx — c 2 ) 



2cj6(a 2 — a-i ) 
(ci + c 2 )(ci - c 2 



-d^x.a^-j^c^j) +S C 



where 9 is given by (3.6). 

Importantly, as the last formula shows, d Cj q is not localized around aj due to the 
Cj-dependence of hz-j. Also note that it is (x — aj) and not (a; — aj) in the first term 
inside the brackets. 

Definition 6.1. Let A denote the class of functions of a, c that are of the form 

h 2 ip(ai — a 2 , a, c) + q(a, c)h 3 , 

a = (ai, a 2 ) G M 2 , < 5 < ci < c 2 — 5 < 1/5, where 

\did k c cFM*,a,c)\<C(a)- N , \d k c cf a q(a, c)\ < C , 

where C depends on 5, N , I, k, and p only. 

We note that if / G iS err , then J f(x)dx has the form <p(a\ — a 2 , a,c), f E A. The 
most important feature of the class A is that for / G A, 

\d k aj dlj\<h 2 { ai -a 2 )- N + h* 

with implicit constant depending on c\, c 2 . 

Lemma 6.1. We have 

(6.3) d aj B(a, c, •) = 2c j b'(a j ) + A 



71 



2 



2 



(6.4) «9 c .B(a, c, •) = 26(6,-) + 2b'(a j )(a j - a,) - — ^(o^cj 

2(-l)^-j(y(a 2 )-y(ai))6> | ^ 
(ci + c 2 )(ci - c 2 ) 

Proof. First we compute d a .B(a,c,t). We have that 9 a .o is exponentially localized 
around aj. Substituting the Taylor expansion of b around aj, we obtain 

(x - aj)d aj q 2 



d a .B{a,c,t)= b{aj) J da^ + b'ia,) J 

+ f{x-a 3 ) 2 d aj q 2 + 0{h 

= I + II + III + 0(h 3 ) 



3> 
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Terms I and II are straightforward. Using (3.21) and (3.22), 



I = b(aj)d aj j q 2 = 

II = b'(dj) ^d a] J xq 2 - djd aj J q 2 ^ = 2c j b\a j ) 



For III, we will substitute (6.1) and hence pick up 0{K 2 )(ai — a 2 ) errors. 



Ill = —-b"(dj) J (x — dj) 2 d x r) 2 (x, dj, Cj) dx + A = A 



Thus, we obtain (6.3). Next, we compute d Cj B(a,c,t). Note that d Cj q is not localized 
around dj. Begin by rewriting d Cj B as 

d c .B= [ b(a j )d C] q 2 + [ b'(a j )(x-d j )d c .q 2 + [ bjdc.it 1 



where 



bj{x) = b(x) - b{dj) - b'{dj)(x - dj) . 



Now substitute (6.2) into the last term and note that the iS err term in (6.2) produces 
an A term here. 



+ 



+ 



2 [ ~ 

bj(x) d x [(x - a,j)r)(x, dj, Cj)]rj(x, dj, Cj 
c 3 -j6 



Cj(cx + c 2 )(ci - c 2 ) 

e 



(ci + c 2 )(ci - c 2 ) 
= I + II + III + IV + V + A 

where terms I-V are studied separately below. 



bj{x)d x rf(x,dj,Cj) 
bj(x)d x n 2 (x, d 3 -j,c 3 -j) + A 



I = b(dj)d Cj I q 2 = 2b(dj) 
II = b'(dj) ^d Cj J xq 2 - djd c . J q" 
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Term III is localized around dj, and thus we integrate by parts in x and Taylor expand 
1 



bj around dj to obtain 



III 



C 3 



bj(x)(x — + bj(x)j if(x, dj, Cj) 
lb"(dj) 



2 Cj 



(x — dj) 2 r] 2 (x, dj, Cj) 



— b"(dj)(dj — aj) J (x — dj)rf(x, dj, Cj) + 0(h 3 ) 



T V>(dj)cf + 0{h 3 ) 



12 

Term IV is localized around dj, and thus we integrate by parts in x and Taylor expand 
bj around dj to obtain 

J bj(x)d x ri 2 (x,dj,Cj) = - J (b'{x) - b{dj))rj 2 {x, dj, Cj ) 

= -\b"(dj) J(x- dj)r] 2 (x, dj, Cj) + 0(h 3 ) 
= 0(h 3 ) 

Term V is localized around d 3 _j, and thus we integrate by parts in x and Taylor 
expand bj around d 3 ^j. 

J bj(x)d x r] 2 (x,d 3 _ j ,c 3 -j) = - J (b'(x) - b(dj))rj 2 (x, d 3 -j, c 3 _,) 

= -(b'(a 3 - j )-b'(a j )) / rj 2 (x, d 3 -j, c 3 -j) 



- b"(d 3 _j) J( x ~ a 3 -j)r] 2 (x, d 3 _j, c 3 _j) + 0(h 3 ) 
= -2c 3 ^{b\d 3 ^)-b'{dj)) + 0{h 3 ) 

□ 

Lemma 6.2 (estimates on F±). 

(6.5) d; 1 d aj F ± = 0(h 2 ) ■ S sol , d; 1 d Cj F ± = 0(h 2 ) -S sol , j = l,2 



2 

(6.6) F ± = -\ h ^d x r{; dj, c 3 ) + A • <S sol 

1 3=1 °J 

where 
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In light of the above lemma, we introduce the notation F± = (Fj_) + Fj_, where 



(6. 



1 A b" (a 



. . 1 \ U \Clj , A , 



3 =1 J 



and F± E A ■ S soh We make use of (6.5) in §7 and (6.6) in m\9 



Proof. We begin by proving (6.6). By (3.18), (3.19), 



d x {bq) = {d x b)q + b{d x q) 

2 2 

= (9 x b) yV(> - a 3 )d a q + Cjd c q) - b V d a q 

(6.9) 

2 2 

= + (^X* - G i))^<? + ^2(d x b) Cj d Cj q + 0(h 3 ) ■ <S sol 

j=i i=i 

The <9 aj .g term is well localized around cij, and thus we can Taylor expand the coeffi- 
cients around cij. The d c .q term we leave alone for the moment. 
We have d x {bq) = 

2 ^ 

^ — b(aj) + b'{aj)(<ij — a 3 ) + b"(cij){cij — aj)(x — cij) + -b"(cij)(x — cij) 2 ^jd aj q 



+ ^ b'(x)cjd Cj q + A ■ S sol 



Substituting the above together with (6.3) and (6.4) into (5.5), we obtain 



1 2 7T 2 

i=i 



(ci + c 2 )(ci -c 2 ) ^ 



We now substitute (6.1) and (6.2) recognizing that this will only generate errors of 
type A times a Schwartz class function. We also Taylor expand around cij or a^j 
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depending upon the localization. 

1 2 7T 2 

F± = - & "(%) ( - Y^ C J 2 + 2 ( a i - a i)( x ~ a i) + - aj) 2 ^d x r}{x, a h Cj) <- I 



(b'(a 2 ) - V {a^B 
(ci + c 2 )(ci - c 2 ) 



3=1 



— b"(dj)(x — aj)9 x [(x — aj)r)(x, dj, c,-)] 

3=1 



-E 

2 

+E 



^ (ci + c 2 )(c x -c 2 ) 



(ci + c 2 )(ci -c 2 ; 



b"(dj)(x — cij)d x r](x, dj,Cj) 
b"(a 3 „j)(x - a z ^j)d x r){x, d 3 _j, c 3 _j) 
{b'(a 3 -j) - b'(aj))d x r](x, a 3 _ i; c 3 _j) 



~ (ci + c 2 )(ci - c 2 
+ A ■ iSgoi 

We have that IV + V = and II + VI = 0. Hence 
F ± = I + III + A ■ S sol 



<- II 



<- III 



<- IV 



<- V 



<- VI 



1 2 / 2 

- ^'(aj)^ ( (t^ c 7 2 + ( x ~ aj) 2 )v(x, 1 

.7=1 ^ 



This completes the proof of (6.6). To obtain (6.5), we note that a consequence of 
(6.6) is F± = 0(h 2 )f, where / £ <S so i. By the definition (5.5) of F± and Corollary 
3.3, we have d~ 1 F± e «S so i, and hence / G <S so i. □ 



7. Estimates on the parameters 

The equations of motion are recovered (in approximate form) using the symplectic 
orthogonality properties (3.26) of v and the equation ( 5.2[ ) for v. For a function G of 
the form 

G = gid ai q + g 2 d a2 q + g 3 d Cl q + g 4 d C2 q 
with gj = gj(a, c), define 

coef(G) = {gi, g2, 9s, 9i) ■ 

Lemma 7.1. Suppose we are given 5q > and bo(x,t), and parameters a(t), c(t) 
such that v defined by (5.1) satisfies the symplectic orthogonality conditions (3.26). 
Suppose, moreover, that the amplitude separation condition (5.6) holds. Then (with 
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implicit constants depending upon 5 > and L°° norms of b and its derivatives), if 
\\v\\fj2 < 1, then we have 

(7.1) Icoef^l^^lHlHi + IHIlri. 

Proof. Since {v,d~ 1 d aj q) = 0, we have upon substituting (5.2) 
= d t (v,d~ l d aj q) 
= {d t v,d~ l d aj q) + (v^td^d^q) 

= ((d 2 x v + 6q 2 )v, 8 aj q) + ((6qv 2 + 2v 3 ),d a] q) -C— 1 + II 

- (bv, d a ,q) - {F h d^d a] q) - (F ± , d x l d aj q) <- III + IV + V 



vfc=l 



k=l 



We have, by KTTh, 



l=(v,d aj (d 2 x q + 2q 3 )) 
= -\(v,d- 1 d aj d x HM) 

2 

= -{v,dZ ld a^c 2 k d ak q) 



k=l 



Also, by ([5j]) 



ni = -(6u,3 a , 9 ) 
= -(v,d aj (bq)) 
= -(v,d- 1 d aj d x (bq)) 



-(v, d- x l d aj J> Cfc 5)<9 afc(? + | ^(«9 a&J B)«9 Cfe(? ) ) 



fe=i 



fc=i 



Thus 



|i + iii + vi| = + (^VW 

< |M| L2 (/i 2 + |coef(^|)|) 
Next, we note that by Cauchy-Schwarz, 

|H|<|M&. 



Next, observe from (5.4) and Lemma 3.6| that 

W=(F h d- 1 d aj q) = - 
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Of course, we have V = (F±, d~ l d a .q) = 0. Combining, we obtain 
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(7.2) 



Cj d a B 



< |M|tfi(/i 2 + |coef(F||)|) + |M|^ 



A similar calculation, applying d t to the identity = (v, d x 1 d c ,q), yields 



(7.3) 



< IHIffi^ + lcoef^O + IMI^. 



Combining (7.2) and (7.3) gives (7.1). 



□ 



8. Correction term 



Recall the definition (6.7) of r. Let p be the unique function solving 

(l-d 2 x -6 V 2 )p = T, 

see [T9(, Proposition 4.2] for the properties of this equation. The function p is smooth, 
exponentially decaying at oo, and satisfies the symplectic orthogonality conditions 



Set 

and note that 



(P ) ? ?)=0, (p,xi])=0 



p(x, aj,Cj) = f Cj l p(cj(x - dj)) 



Define the symplectic projection operator 



Pf = J2(f,d;%q)d aj q + J2(f,d^d aj q)d Cj q. 



Define 
(8.2) 



3 =i j 



Note that w = 0{h 2 ) and clearly now w satisfies 

(8.3) (w, d~ l d aj q) = , (w, d~ l d C] q) = . 

Recall the definition (6.8) of (F±) . 

Lemma 8.1. If dj = O(h), and hj = Cj — b{dj) + O(h), then 



•4) 



d t w + d x {d 2 x w + 6q 2 w - bw) = -{F x ) -G + A- <S so i . 



where G is an 0(h 2 ) term that is symplectically parallel to M, i.e. 

G e span-j^d^g, d^d^q, d~ l d Cl q, d^d^q] . 
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Proof. Let 



w 



b"{dj) , . , 

3 ^2 P\i a ii C j, 



Then 



d tWj = b'"(d j )d j cj 2 p(-, dj, Cj ) - 2b"( y d j )cj 3 c j p(-, dj, Cj ) 

+ b"(dj)c~ 2 djd aj p(-, dj, cj) + b"(dj)cj 2 Cjd Cj p(-, dj, cj) + d t b" \dj)cj 2 p(- , dj, Cj) 
= -djd x Wj + A ■ S so i 



Also, we have 



(&l + 6q 2 )vjj = (dl + Qi] 2 (-, dj, Cj))wj + A ■ <S so i 



c 2 Wj — b"(dj)Cj 2 r(-, dj, Cj) + A ■ S t 



sol 



Also, 



bwj = b(dj)wj + A ■ S so \ 



Combining, we obtain 



d t Wj + d x (d 2 Wj + 6q 2 Wj — bwj) 

= -b"(dj)c~ 2 d x r(-, dj,Cj) + {-dj + c 2 - b(dj))d x Wj + A ■ S so \ 

= -b"(dj)c~ 2 d x r(-, dj,Cj) + A ■ <S so i 

Now we discuss d t Pwj. 

d t Pwj = (d t Wj, d^d^qjd^q + (vjj, d t d x l d ai q)d C] q + similar 

+ (wj, d~ l d ai q)d t d Cl q + similar 

The first line of terms is symplectically parallel to M. For the second line, note that 
by (8.1), we have (wj,d~ 1 d ai q) = A. Consequently, 

d t PWj = T q M + A ■ S so i 



□ 



Define u and v by 
5.5) 



u = u + w , v = v + w. 



Of course, it follows that u = q + v. Note that by (3.26) and (8.3), we have 

(8.6) (v, d~ l d aj q) = and (v,%%q) = 0, j = 1,2. 
Note that u solves 

(8.7) d t u = -d x {d 2 x u + 2u 3 - bu) - d t w - d x (d 2 x w + 6u 2 w - bw) + 0(h 4 ) 

where the 0(h 4 ) terms arise from w 2 and w 3 . Moreover, if we make the mild assump- 
tion that v = 0(h), then u 2 w = q 2 w + 0(h 3 ). By fl8?F| ) and we have 

(8.8) d t u = -d x (d 2 x u + 2u 3 - bu) + (F±) + G + A ■ S sol 
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Since u = q + v , we have (in analogy with (5.2)) 

(8.9) d t v = d x {-dlv - Qq 2 v + bv) - F\\ - F± + G + A ■ S sol + O^H 1 

where we have made the assumption that v = 0(h 3 ^ 2 ) in order to discard the v 2 and 
v 3 terms. We thus see that, in comparison to v, the equation for v has a lower-order 
inhomogeneity, but still satisfies the symplectic orthogonality conditions (8.6) and 
v = v + 0(h 2 ). 



9. Energy estimate 

Since w = 0(h 2 ), to obtain the desired bound on v it will suffice to obtain a bound 
for v. This will be achieved by the "energy method." 

Lemma 9.1. Suppose we are given 5q > and bo(x,t), and parameters a(t), c(t) 
such that v defined by (5.1) satisfies the symplectic orthogonality conditions (3.26) 
on [0,T]. Suppose, moreover, that the amplitude separation condition (5.6) holds on 
[0, T]. Then (with implicit constants depending upon <5 > and L°° norms of bo and 
its derivatives), if IHIh 2 < 1 and T h~ l , then 

2 



\l~ T] m< H0)\\ 2 m 



h 4 (^l + J ( ai -a 2 )- N dt^ 



Proof. Recall that we have defined 

H c {u) = h{u) + (c? + c 2 2 )h{u) + c\c 2 2 h{u) . 



With w given by (8.2) and u given by (8.5), let 



Then 



£{t) = H c {u) - H c (q) . 

d t £ = (H' c (u),d t u) - (H' c (q),d t q) + 2{c lCl + c 2 c 2 ){I 3 (u) - J 3 (g)) 
+ 2c 1 c 2 (c 1 c 2 + tic 2 )(h(u) - h(q)) 
= I + II + 111 + IV 



Note that II = since Lemma 4.1 showed that H' c (q) = 0. For III, we have by (3.17) 
and the orthogonality conditions (8.6), 

III = 2(c a c! + c 2 c 2 )((I' 3 (q), v) + 0(\\v\\m)) 

2 

= 4(dCi + c^ij^cp-'da^v) + + |c 2 |)||t>||^) 

= 0{{\c x \ + \c 2 \)\\vf m ) 
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Term IV is bounded similarly. It remains to study Term I. Writing (8.8) as d t u 
ld x I' 3 {u) + d x {bu) + {F ± ) 



G + A ■ S so \ and appealing to (2.5), we have by Lemma 
2.1| (with u replaced by u in that lemma) that 

I = (H' c (u), d x (bu)) + (H' c (u), (Fjo + A ■ S sol ) 
= 5{b x ,A 5 (u)) - 5(b xxx , A 3 (u)) + (b xxxxx , A^u)) 

+ (c? + 4)(3(b x , A 3 (u)) - (b xxx , + clc 2 2 (b x , A 1 {u)) 

+ (H' c (u),(F ± ) + A-S aol ) 

Expand Aj(u) = Aj{q + v) = Aj{q) + Aj(q)(v) + 0{v 2 ) and H' c (u) = H' c {q) + K, c>a v + 
0{v 2 ) = K CA v + 0{v 2 ) to obtain I = IA + IB + IC, where 

IA = 5(b x , A 5 {q)) - 5(b xxx , A 3 (q)) + (b 

XXXXXl AM) 

+ (c? + cl)(3(b x , A 3 (q)) - (b xxx , A^q))) + c\c\{b Xl A 1 {q)) 
IB = 5(b x ,A' 5 {q)(v)) - Hb xxx ,A' 3 (q)(v)) + (b xxxxx ,A[(q)(v)) 

+ (cl + cl)(3(b x ,A' 3 (q)(v)) - (b xxx ,A[(q)(v)))+clc 2 2 (b x ,A[(q)(v)) 
IC = (JC c>a v, (F ± ) ) + 0{h\\vf H A + 0{A ■ \\v\\ m ) 
Then reapply Lemma 2.1| (with u replaced by q in that lemma) to obtain that IA = 



— (H' c (q) } d x (bq)} = 0. Applying Lemma 2.2 

IB = ()C c>a v,(bq) x ) - (d x H' c (q)M 
= (/C C)0 t>, (bq) x ) 

In summary thus far, we have obtained that 

d t £ = (JC c , a v, (bq) x + (F ± ) ) + 0{h\\v\\\.) + 0(A\\v\\ H2 ) 



By (4.12), (4.13), and (8.6) (recalling the definition (5.3) of F ), we obtain 
(K, Cja v,d x (bq)) = -(K, C7a v,F ) = -(K. c>a v,F\\ + F±) 

Hence 

d t £ = -(Kcav, F\\ + F ± ) + 0(h\\v\\ 2 H A + 0(A\\v\\w) 
It follows from Lemma 



7.1 



and F± G A ■ S soi (see Q) that 

\dt£\ < (h 2 ( ai - a 2 )- N + h 3 )\\v\\ H > + h\\v\\ 2 H2 



If T 



5h-\ 
£{T) = £(0) 



h 2 1 + / (a! - a 2 ) 



-iV 







ML 00 H 2 + II rao rr2 



By Lemma |4.1| the definition of £ and /C Cja , and the fact that u = q + v , we have 

|3 

\H 2 ■ 



\£-(ICc,aV,v)\<\\v\\ 3 : 
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Applying this at time and T, together with the coercivity of /C (Proposition 4.3) 
MT)\\h £ \WQ)\\ 2 H* + h 2 (l+ [ (oi - a 2 )- N ) « + h\\v\\ 2 „ 2 . 



o 



Replacing T by T 1 such that < T" < T, and taking the supremum in T' over 



< T' < T, we obtain 



M\l™ T] m £ 11^(0)11^2 + h 2 (l + (ai - a 2 ) \\v\\ l ~ t] hI + h\\v\\l^ )T] m • 
By selecting S small enough, we obtain 

\Ml~ T]H > < \\v(0)\\h +h*(l + j\a x - a 2 )~ N dty 
Finally, using that ||iu ||#2 ~ h 2 , and v = v + w, we obtained the claimed estimate. □ 

10. Proof of the main theorem 

We start with the proposition which links the ODE analysis with the estimates on 
the error term v: 

Proposition 10.1. Suppose we are given 6 £ C^°(1R 2 ) and So > 0. (Implicit con- 
stants below depend only on bo and So). Suppose that we are further given a £ M 2 , 



c £ Mr\C, k > 1, h > 0, and vq satisfying (3.26), such that 



< h < k 1 , \\vo\\m < Kh 2 



Letu(t) be the solution to (1.1) withb(x,t) = b (hx,ht) and initial data r/(-, a, c) + v . 



Then there exist a time T' > and trajectories a(t) and c(t) defined on [0,T'] such 



that a(0) = a, c(0) = c and the following holds, with v = u — r](-, a, c): 



(1) On [0, T'\, the orthogonality conditions (3.26) hold. 

(2) Either ci(T') = S , ci(T') = c 2 (T') - 5 , c 2 (T') = 5 \ or T' = w/r 1 , where 
u < 1. 

(3) |d i -t^ + 6(a i ,t)| < fc. 

(4) l^-c,^)!^^ 2 . 

(5) |M|f°° H 2 < anh 2 , where a 1. 

i/ere a and a; are constants depending only on bo and So ( independent of k, etc ) 
Proof. Recall our convention that implicit constants depend only on o an d S. By 



Lemma 3.7 and the continuity of the flow u(t) in H 2 , there exists some T" > on 



which a(t), c(t) can be defined so that (3.26) hold. Now take T" to be the maximal 



time on which a(t), c(t) can be defined so that (3.26) holds. Let T' be first time 
< V < T" such that ci(T') = S , ci(T') = c 2 (T) - S , c 2 (T') = V = T", or 
uh^ 1 (whichever comes first). Here, < u <C 1 is a constant that we will chosen 
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suitably small at the end of the proof (depending only upon implicit constants in the 
estimates, and hence only on b and S). 

Remark 10.2. We will show that on [0, T'], we have ||v(i)||#2 < nh 2 , and hence 
by Lemma 3.7 and the continuity of the u(t) flow, it must be the case that either 
d(T') = S , ci(T') = c 2 (T') - 6 , c 2 (T') = S^ 1 , or uhr 1 (i.e. the case T = T" does 
not arise). 

Let T, < T < T", be the maximal time such that 



:io.i^ 



lull roo m < aK h , 



where a is suitably large constant related to the implicit constants in the estimates 
(and thus dependent only upon bo and So > 0). 



Remark 10.3. We will show, assuming that (10.1) holds, that || < ^anh 1 ' 2 

and thus by continuity we must have T = T'. 

In the remainder of the proof, we work on the time interval [0, T], and we are able 
to assume that the orthogonality conditions ( |3.26[ ) hold, So < c i(t) < c 2 (t) — So < S , 
and that ( 10.1 ) holds. By Lemma 7.1 and Taylor expansion, we have (since K 2 h 4 < h 2 ) 

(10.2) 



c 2 -b(a 3 ,t) + 0(h) 



dj = Cjd x b(a,j,t) + 0(h 



with initial data Oj(0) 



J ! 



9(0) 



a,-. Let 



e(t) 



dcf 



b{ ai {t),t)-b{a 2 {t),t) 



a>i(t) - a 2 (t) 

and let denote an antiderivative. By the mean- value theorem |£| < /i, and since 
T < uh" 1 , we have e~ ~ 1. We then have 

j t (e E (a 2 - ax)) = e H (c^ - c?) + O(h) . 

Since 5q < c 2 , — c 2 , we see that e = (a 2 — ai) is strictly increasing. Let < t% < T 
denote the unique time at which e = (a 2 — a x ) = (if the quantity is always positive, take 
t\ = 0, and if the quantity is always negative, take t% = T, and make straightforward 
modifications to the argument below). If t < ti, integrating from t to t% we obtain 



S 2 o(ti-t)<-e m (a 2 (t)- ai (t)) 
If t > ti, integrating from t\ to t we obtain 



i m \a 2 (t) -ax(t)\ 



< 



a 2 (t) - ai{t)) . 



Hence, 



(a 2 (t) - ai (t))- 2 < 1. 
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By Lemma[9.1[ we conclude that 



OL 

\v\\ L¥ Hi<j(M0)\\H> + h 2 



< -anli 2 
~ 2 



□ 



We can now complete 



Proof of the main Theorem. Suppose that ||fo||_ff 2 < h 2 . Iterate Prop. 10.1. as long 
as the condition 



(10.3) 



So < ci < c 2 - S < S 1 



remains true, as follows: for the k-th iterate, put k = a k in Prop. 10.1 and advance 
from time tk = kuh' 1 to time tk+\ = (k + Vjuoh^ 1 . At time tk, we have ||v(tfc)||#2 < 
a k h 2 , and we find from Prop. 10.1 that |M|l°° h 2 < a k+1 h 2 . Provided (10.3) 



holds on all of [0, we can continue until k 1 ~ h, i.e. K ~ \ogh 1 . 

Recall ( flT5] ), and Aj (T) , Cj (T) defined by ([TTTJ. Let a^t) = h^A^ht), Cj(t) 
Cj(ht). Then dj, tj solve 

a2 



6(dj, t) 
Cjd x b(dj,t) 



with initial data &j(0) = Cj(0) = Cj. We know that (10.3) holds for Cj on [0, h 1 T( 



Let a, 



cij — dj, Cj = Cj — dj denote the differences. Let 



7(0 



def 



b(a,j,t) — b(dj, t) 



Qj j CL rj 



a(t) 



def 



d x b(a,j,t) - d x b(dj,t) 



cij dj 



By the mean-value theorem, |7(i)| < h and \c(t)\ < /i 2 . We have 

■j = Cj + ^CjCj — 7cij + 0(h) 



(10.4) 



Cj{d x b)(dj ) t) + CjCraj + (9(/i 2 ) . 



We conclude that |a,-| < e c?ti and |c,-| < he Cht . This is proved by Gronwall's method 



and a bootstrap argument. Since (10.3) holds for dj on [0,/i _1 T ], it holds for on 
the same time scale if T < oo, and up to the maximum time allowable by the above 
iteration argument, eh -1 log h" 1 , if T = +oo. □ 
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Appendix A. Local and global well-posedness 
In this appendix, we will prove that (1.1) is globally well-posed in H k , k > 1 



provided 



(A.l) M(T) \\dib(x,t)\\L~L T < oo. 



=0 



for all T > 0. This is proved for k = 1 under the additional assumption that 
IHUiij? < 00 m ^ ne appendix of Dejak-Sigal [11]. |*] The removal of the assump- 
tion ||6||l| L oo < oo is convenient since it allows for us to consider potentials that 
asymptotically in x converge to a nonzero number, rather than decay. Moreover, our 
argument is self-contained. 

Well-posedness for KdV (nonlinearity d x u 2 ) with 6 = was obtained by Bona-Smith 
[5] via the energy method, using the vanishing viscosity technique for construction 
and a regularization argument for uniqueness. Although their argument adapts to 



include 6^0 and to mKdV (1.1), it applies only for k > | due to the derivative 
in the nonlinearity. Kenig-Ponce-Vega j2B EH] reduced the regularity requirements 
(for b = 0) below k = 1 by introducing new local smoothing and maximal function 
estimates and applying the contraction method. These estimates were obtained by 
Fourier analysis (Plancherel's theorem, van der Corput lemma). At the H 1 level of 
regularity (and above) for mKdV, the full strength of the maximal function estimate 
in J2TJ [20] is not needed. Here, we prove a local smoothing estimate and a (weak) 



maximal function estimate (see (A. 2) and (A. 3) in Lemma A.l below) instead by the 



integrating factor method, which easily accomodates the inclusion of a potential term 
since integration by parts can be applied. The estimates proved by Kenig-Ponce-Vega 
were directly applied by Dejak-Sigal, treating the potential term as a perturbation, 
which required introducing the norm ||&||l2 L °o. Our argument does not apply directly 
to KdV since we are lacking the (strong) maximal function estimate used by 



Let Q n = [n — |, n + |] so that 1R = UQ n . Let Q n = [n — 1, n + 1]. An example of 
our notation is: 

||w||/oor2r2 = SUP 1 1 1 1 r 2 r2 . 

We will use variants like £^L^LQ n etc. Note that due to the finite incidence of overlap, 
we have 



4t is further assumed in [TT] that is small, although this appears to be unnecessary in 

their argument. 
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Theorem A.l (local well-posedness) . Take fceZ, k > 1. Suppose that 

k+1 

M d = f ^||^,t)|| W? <oo. 

For any R > 1, take 

T< mm(M-\R- 4 ). 

(1) If 1 1 Uq 1 1 jjk < R, there exists a solution u(t) G C([Q,T];H£) to (1.1) on [0,T] 
with initial data Uq(x) satisfying 

IMIl§?it* + \\d k £ +1 u\\^LlLl n ^R- 

(2) This solution u(t) is unique among all solutions in C([0, T]; H£). 

(3) The data-to- solution map u H- u{t) is continuous as a mapping H k — > 
C([0,T};H k x ). 



The main tool in the proof of Theorem A.l is the local smoothing estimate (A.2) 
below. 

Lemma A.l. Suppose that 

vt + Vxxx ~ (bv) x = f . 

We have, for 

t< (l + iiML^ + ii&iu^r 1 , 

the energy and local smoothing estimates 



( A - 2 ) IMU§?£2 + IHIc^e-L ~ IKIUi + 



and the maximal function estimate 



Id-'fl 



'1/2 r 2 



(A3) \HtiL- L i n < IkolUi + T^ii^L^+T 1 / 2 !!/!!^. 

The implicit constants are independent of b. 

Proof. Let ip(x) = — tan _1 (x — n), and set w(x,t) = e v ^ x 'v(x,t). Note that < e"? < 
e ¥>O0 < < oo, so the inclusion of this factor is harmless in the estimates, although 
has the benefit of generating the "local smoothing" term in ( A.2[ ). We have 

d t w+w xxx -3tp'w xx +3(-tp" + (if') 2 )w x + (-(f" / +3tp"ip' -(if') 3 )w-(bw) x +if'bw = e 9 f . 

This equation and manipulations based on integration by parts show that 

d t \\w\\ Ll = 6&',wl) - 3<(V' + &) 2 )',w 2 ) + 2(- ¥ >"' + W - (^')> 2 > 
-(b x ,w 2 )+2(b^',w 2 ) + 2(w,e^f). 
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We integrate the above identity over [0, T], move the smoothing term 6 / Q T (ip', w 2 x ) x dt 
over to the left side, and estimate the remaining terms to obtain: 



\w(T)\\l 2 +6\\(x- n)-^ 



L 2 T Ll 









f 


J e^fw dx 


dt 


to 







Replacing T by T", and taking the supremum over T" G [0, T], we obtain, for T < 
(1 + IIMi-iff + WHl^l^)' 1 , the estimate 











J e^fw dx 


dt 


Jo 







HIl-l? + W-n) w x \\ L 2 Ll < \\w \\ L 2 + 



Using that < e _7r / 2 < e v < e^ 2 < oo, this estimate can be converted back to an 
estimate for v. 



I L^L\ 



T Qn x 



e 2(p fv dx 



dt. 



Estimating as 



e 2ip fv dx 



dt < 



and then taking the supremum in n yields the second bound in (A. 2). Estimating 
instead as: 



^fv dx 



dt 



< 



< 



e 2 *{d x d~ x f)vdx dt 
{d~ 1 f)d x {e 2tp v)dx dt 



£ II^Vll 



r2 j2 

T Q; 



\{dx)v\\ L 2 L 2 

± Wr, 



and taking the supremum in n yields the second bound in (A.2). 

For the estimate (A. 3), we take i[)(x) = 1 on [n — |,n+|] and outside [n— + 



set w = ipv, and compute, similarly to the above, 



r oo j 2 

T Q, 



<IKHi, +T\\V X \\ 2 L 2 TL 2 +T\\f\\\2 2 

Qn T Qn T Q. 



The proof is completed by summing in n. 



□ 



Proof of Theorem \A.l\ We prove the existence by contraction in the space X, where 

x = i u I \\ u \\c{[o,nHi) + \\d!; +1 u\U??LiL%„ + sup \\dy\\ e?iL¥L 2 Q ^ <CR}. 



a<k-l 
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Here C is just chosen large enough to exceed the implicit constant in (A. 2). Given 
u £ X, let (f(u) denote the solution to 

(A.4) d t <p(u) + d 3 Mu) - d x {hp{u)) = -2d x (u 3 ) . 



with initial condition y?(w)(0) = Uq. A fixed point (p{u) = u in X will solve (1.1). We 
separately treat the case k = 1 for clarity of exposition. 



Case k = 1. Applying d x to (A.4) gives, with v = (p(u) x , 

v t + v xxx - (bv) x = -2(u 3 ) xx + (b x Lp(u)) x . 



Now, (A.2) gives 
(A.5) 



\W{u) x \\l™li + \\(p{u) xx \\e™tf T Ll n ^ 



Using that \\u\\ L 



2 < 



NHlfl + \\(u 3 ) x \\ e i nL 2 TL 2 Qn + || {b x ip{u)) x \\ L 1 tL 2 

\ u \\l 2 - + II^zIIlOIMIl 2 - > we a l so have 



) x \\lI ^ IMIr» NIl°° ^ ll^xlUa ||m|| L |(||u|| L 2 + \\u x \\ L 2_) 
Taking the L\ norm and applying the Holder inequality, we obtain 



ll(« 8 ).| 



LlL 2 Q H \\u x \\l™L 2 q \\ u \\l°°I?q(\\ u \\l*L* q + \\ u x\\l4,L 2 Q ) ■ 



Taking the l\ norm and applying the Holder inequality again yields 

IIC" 3 )x|Ul2Z2 < IKxHe-iS?^ \ U \llL™L\ (II^IU? r2 ^2 + 11^11^2,2^2 



J T Qr, 



Using the straightforward bounds HmJI^l^l 2 ^ ||w x ||i,«=z, 2 , 



\ u \\llLlL\ ^ |M 



and 



we obtain 



\M\P n L* T L* 



< 



\Ux\\L* T Ll 



<T^\\ 



<T^\\ 



u \\L^Ll 



U x \\L™Ll 



\\(u 3 ) x \\ e i nL 2 TL 2 Qr 



^ Tl/2 \H\l^Hl\\ U hiLS?L% 



Inserting these bounds into (A.5), 



(A.6) \\<P(U) x \\ L oo L 2 + \\ip(u) xx \\ e ^ L 2 TL 2 Qn < \\U \\ H 1 +T 1/2 ||M||^ H l||M|| 4L? c i 2^ 



+ T(||6 a ,||ji« + ||&aw|U«>)||¥>(*0 



The local smoothing estimate (A.2) applied to v — <p(u) (not v = (f(u) x as above), 
and the estimate 



\\(u 3 ) x \ 



LipL 2 



< 



T\\u\ 



provides the estimate 
(A.7) 



Mu)\\ 



< 
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The maximal function estimate (A. 3) applied to v — (f(u) and the estimate 



iu%\\ L% Li<T^\\u\\* 



give the estimate 



(A. 



\W(y)\\t? n L~Li n < IKIUi + Ty( u )\\ L oo H i + t\\u\\1 ¥H i 



Summing (A.6), (A.7), (A. 8), we obtain that ||<^(w)||x < CR if \\u\\ x < CR provided 



T is as stated above. Thus (p : X — > X. A similar argument establishes that (p is a 
contraction on X. 



Case k > 2. Differentiating (A. 4) k times with respect to x we obtain, with v 
d t v + d 3 x v - d x {bv) = -2<t + V) - 2d x 9 * b d *v{u) ■ 



a+/3<k+l 
/3<k-l 



Using (A. 2) gives 



\d«ip{ U )\\ L ~ L 2 + \\d* +1 <p{ U )\U L 2r2 



< 



\\d k x u 3 \\ W Ll + sup \\d x {dtbd^{u))\\ L}rLl . 

P<k-1 



Expanding, and applying Leibniz rule gives 



a</3< 7 



which is then estimated as follows 
\\d k x u\\ llL%Llin < 



|9>|| W ^J|9>|| Wi g>J|a>|| WL 2 Qn . 



a</3<-y 



By the Sobolev embedding theorem (as in the k = 1 case) we obtain 



fc„.3| 



\dtu 



< 



V ( sup ||<9>|| 4L oo L 2 J ( sup ||a>| 



02 t2 j2 



a+/3+7=fc 
a</9<7 



\&lu\ 



T oc r 2 
1j t ±j x 



When k > 2, we have a < [[hk]] < k — 2 and (3 < [[■kk]] < k — 1, and therefore 



\\d k x u 3 \\^ T Ll <T 1,2 ( sup \\d[ 

T Qn \a<k-l 



: U \\t*L~I? Qr 



\U\ 



Also, 



\\d x {d«b d^M 



< T 



sup 

a<fc+l 



|¥>WlU§?fl* 
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Combining these estimates, we obtain 
(A.9) 



poo j 2 j 2 5^ 



PO if* 



&(«)|U~Lg + R* + V(«)il, 

+ T 1/2 ( sup ||<9>|| £ 2 i?L 2 j \\u\\* Hk +T ( sup ll^&H^oaigc ) y(u)\\ L¥H h 

\a<k-l / T \a<fc+l / 



The local smoothing || (w 3 ) x ||i,i l 2 ~ ^1Mli°°H-i to obtain 



(A.10) 



llv(«)ll 



7" oo r 2 



<T||d 



We apply the maximal function estimate (A. 3) to v — d"if{u) for a < k — 1 and use 
that ||d£ +1 w 3 || L i L 2 < THull 3 ^^ and 



to obtain 
(A.11) 



|^ +1 (^(«))|| L ^ < T ( SUp H^llxgpiy ) 11^)11^^ 

vJ 8<A 



+ T ( sup 



T oo j oo 



Summing (A.9), (A. 10), (A. 11), we obtain that <p : X — > X, and a similar argument 



shows that ip is a contraction. This concludes the case k >2. 

To establish uniqueness within the broader class of solutions belonging merely to 



C([0, T]; H^), we argue as follows. Suppose u, v G C([0, T]; H],) solve (1.1). By (A. 3) 

\\v\\p n L?°Ll n < Ikoll^ + T|k||L-ffi + T\\vf L ^ Hl . 
By taking T small enough in terms of ||t> we have that 
(A.12) 
Similarly, 
(A.13) 



W\\e2 nL¥L 2 < \\u\\ L??H i 



Set w = u — v. Then, with g = (u 3 — v 3 ) /(u — v) = u 2 + uv + v 2 , we have 

w t + w xxx - (bw) x ± (ffw)^ = . 



Apply (A. 2) to v = w x to obtain 
(A. 14) 



w x \\l™li + \\w xx \\^ L 2t2 < \\(gw) x \\n L 2 L 2 + \\(b x w) 



The terms of || (gw) x \\ni L 2 L 2 are bounded following the method used above: 



< 



K\\^L ¥ ^ Qn (bw\\ WL X Qn + \\{vw) x \\ elnL 2 TL i Qn ) 
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The term in parentheses is bounded by 

f «2r2 r2 UroarJ + 17™ L>2 r 2 r 2 \\w (2roor2 + W j2foof2 \\W T \\ c2 T 2 r 2 

II \\(. n l^ T ±jQ n || M^i/j, iyQ^ II J- \\t n L, T L,Q n II U^-Lly ilg^ II \\L n ±J T L,Q n \\ X I I ^Tl^T Q n 

which leads to the bound 

(A.15) |KHki4£ 2 Qn < ^ 1/2 II^II^5?^ (II^IU-^ ll^ll^^^^^ H- H^IUa^i,^^ H^H^-j^i) 
We now allow implicit constants to depend upon ||«||i,o?ffi and || z, 00 ^ 1 • Appealing 



to (A. 14), (A.15) (and analogous estimates for other terms in gw), (A. 12), (A. 13) to 



obtain 



\ w \\l%hi < T 1/2 (||w||^ L oo L ^ + \\w\\ L ^ H i) 



Combining this estimate with the maximal function estimate (A. 3 ) applied to w yields 



IMkL5?L Qn < T 1/2 \\w\\ L cp H i + T\\g\\ L cp H i\\w\\ L ™ H i . 

This gives w = for T sufficiently small. The continuity of the data-to-solution map 
is proved using similar arguments. □ 

Next, we prove global well-posedness in H k by proving a priori bounds. Theorem 



A.l shows that doing it suffices for global well-posedness 



Theorem A. 2 (global well-posedness). Fix k > 1 and suppose M(T) < oo for all 
T > 0, where M(T) is defined in (A.l). Foru^ G H k , there is a unique global solution 



u £ C/oc([0, +oo); H k ) to (1.1) with \\u\\ L ao H k controlled by ||w ||#fc ; T, and M(T). 



Proof. Before beginning, we note that by the Gagliaro-Nirenberg inequality, ||w||^ 4 
I Ml la 1 1 Wa;|| £2, we have (in the focusing case) 



< 



u 



With a 



u 



x\\ L 2j 



u 



L 2 



x\\L 2 



and (3 



\\u x || \\u 

IM/Wv 



h < h{u) < \\u x \\i 2 



\ L 2, 



this is a — a 1 / 2 < (3 < a, which implies 



1 6 



that (a) ~ (/?}, i.e. 

\\u x \\ 2 L 2 + \\U\\ L 2 ~ j 3 1 , 
The same statement holds in the defocusing case. 



h(u) 



u. 



L 2 



-i J (u) = (r 1 ( u ),d t u) 



Another fact we need is based on the 
d 
~dt 

= (I'j(u), -u xxx - 2(u z ) x + (bu) x ) 

= </;(«), d x i' 3 ( u )) + (i'^u), {b U ) x ) 

= <J}(tt),(6u) SB > 

For u(t) G L 2 , we compute near conservation of momentum and energy from Lemma 



ED 



d 
dt 



h{u) = (b x ,A 1 (u)) 
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Estimate \(b x , Ax(u))\ < \\b x \\L^h{u), and apply Gronwall to obtain a bound on 
H m IIl°°l 2 i n terms of ||& x ||l°°l°° and ||mo||l 2 - F° r u (t) £ H 1 , we compute near conser- 



vation of energy from Lemma 2.1 

d 



dt 



I 3 (u) = 3{b x , A 3 (u)) - {b xxx , A^u)} . 



We have 



\(b x ,A 3 (u))\ < ||6x|U»(||Wa:||ia + \\u\\i<) 

< IIMU^GKHl 2 + ||tta||L3||M||ia) 

< ||6x||L»(|K||ia + IHlia) 



< \\b x \\ L °°(h(u) 



\u\ 



L 2 



and 



Combining these gives 
d 



dt 



:h(u) 



\{b xxx ,Ai(u))\ < || ^xzzIIl 00 |I m IIl2 . 
iS \\b x \\L°°h(u) + \\b x \\L°°\\u\\ 6 L 2 + \\b xxx \\L°° \\u\\ 2 L 2 



Gronwall's inequality, combined with the previous bound on ||w||l 2 > gives the bound 
on I 3 (u) and hence ||w||#i. 



For u(t) e H 2 , we apply Lemma 2.1 to obtain 
d 



dt 



I 5 (u) = (I' 5 (u),(bu) x ) 



We have 



Also, 
and 



= 5(b x ,A 5 (u)) -5(b xxx ,A 3 (u)) + (b xxxxx , A^u)) 

\{b x ,A 5 (u)}\ < \\b x \\ L °°(\\u xx \\ 2 L 2 + \\u\\ 4 H x + \\u\\ 6 H i) 

< \\b x \\L°°h(u) + \\b x \\ L °°(\\u\\%i + \\u\\%i) 

\(b xxx ,A 3 (u))\ < \\b xxx \\ L °°(\\u\\ 2 H i + \\u\\ 4 H i) 
\{b XXXXX) Ai{u))\ < \\b xxx \\ L oa\\{u 2 ) xx \\ L 2 < \\b xxx \\ L oo \\u\\ H 2\\u\\ L 2 



Combining, applying Gronwall's inequality, and appealing to the bound on ||m||//i 
obtained previously, we obtain the claimed a priori bound in the case k = 2. 

Bounds on H k for k > 3 can be obtained by the above method appealing to higher- 



order analogues of the identities in Lemma 2.1 However, starting with k = 3, we do 



not need such refined information. By direct computation from (1.1), 

d 
It 



\d k x u\\ 2 L 2 



d k+ \bu) d k x ii + 2 J d k+l u 3 d k x u 
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In the Leibniz expansion of d k+1 u 3 , we isolate two cases: 



d k+1 u 3 = 3u 2 d k+1 u + c «M d > d x u 3 



lu 



For the first term, 



■ft 



2 dt +1 udt 



ft 



a+/3+~f=k+l 
«</3<7<fc 



y)Muf 



< HmII^iim 



m\\ a \\H k 



By the Holder's inequality and interpolation, ifa + /3 + 7 = fc + l and 7 < k, 

||<9> d^u 3T x u\\ L 2 < \\u\\ 2 H2 \\u\\ H k 

Thus we have 



d k+1 u 3 d k u 



<r 11 n2 11 ||2 



Similarly, we can bound 



d k x +1 {bu)d k x u 



M{t)\\u 



I 2 



by separately considering the term bd k+1 ud k u and integrating by parts. We obtain 

d 

dt' 

and can apply the Gronwall inequality to obtain the desired a priori bound. □ 



:ll« 2 



<(M+ 11^)11** 



Appendix B. Comments about the effective ODEs 

Here we make some comments about the differential equations for the parameters 
a and c. 

B.l. Conditions on T . First we give a reason for replacing T Q (h) in the definition 



of T{h) (1.5) by T defined by (1.12). In (10.2) we have seen that the a and c solving 



the system (|1.4|) give the following equations for A = ha, C = c, T = ht: 

A(0) = ah, C(0) = c, j = 1,2 



d T A 3 =C 2 -b (A 3 ,T) + O(h) 



d T C, = C 3 d x b (A v T) + O(h) 



This can also be seen by analysing (B.6) using Lemma 3.2 

As in (10.4) we can write the equations for Aj — Aj and Cj — Cf 



d T {A 



A,) = (Cj - C 3 Y + 2Cj(Cj - Cj) + 7o (A ? - - Aj) + O(h) 



d T (Cj - Cj) = (Cj - Cj)(d x b )(Aj,t) + Cja (Aj - Aj) + 0(h) 
Aj(0) - Aj(0) = , Cj(0) - (7,(0) = , 
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where 7o,o"o = 0(1). This implies that 

MT) - MT) = 0(h)e CT , 
CjiT) - Cj(T) = 0(h)e CT . 

This means that for T < 8\og(l/h), we have Cj (T) = Cj(T) + 0(h}- 5C ). Hence, if 5 



is small enough, then for small h we have that T (h) defined in (1.5) and T in (1.12) 
can be interchanged. 



B.2. Examples with Cj going to 0. In the decoupled equations (1.11 ) we can have 



00 



which implies that Tq < 00 in the definition (1.12). That prevents \og(l/h)/h lifespan 



of the approximation (1.3). 
Let us put 



a = Aj , c = C 



j ■ 



so that the system (1.11) becomes 

a' T = c 2 (T) - b (a,T) , c' T = cd a b (a,T) . 



(B.l) 



For simplicity we consider the case of bo(a, T) = bo{a)- In that case the Hamiltonian 

E(a, c) = -^c 3 + cb (a) 
is conserved in the evolution and we have 

(B.2) exp(Tmin<9 a &) < \c(T)\ < exp(Tmax<9 a &) . 

In particular this means that c > 5 > if T < Ti(S). 



We cannot improve on (B.2), and in general we may have 

\c(T)\ < e" 7T , T — > 00 , 

but this behaviour is rare. First we note that the conservation of E shows that if 
c(Tj) — y for some sequence 7} — > 00, then E = 0. We can then solve for c, and the 
equation reduces to da/dT = 26 (a), c 2 = 3bo(a), that is to 

(B.3) *f°W) =T ' 6W0))>0 ' 



'Hi 



If bn(a) > in this set of values a then 



(B.4) 

and c(T) = (36o(o(T)))a. 



a(T) — )• 00 , T — )• 00 
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If b (a) = for some a > a(0) (a' T = 2b > 0), then we denote ax, the smallest 
such a and assume that the order of vanishing of bo there is l\. The analysis of (B.3) 
shows that 

r Ke~ jT e 1 = i, 

a(T) = a x + 0(1) I 

which gives the rate of decay of c(T). 

Hence we have shown the following statement which is almost as long to state as 
to prove: 



Lemma B.l. Suppose that in (B.l) bo = bo(a). Then 

E^O, |c(0)| > 5 > => 35 > VT > 0, \c(T)\ > 5 . 
If E = 0, let 

a\ = min{a : a > a(0) , b (a) = 0} , 

with ai not defined if the set is empty (note that c(0) ^ and E = imply that 
b o (a(0)) > 0). Now suppose that ai exists, and that 

d e bo( ai ) = , £ < it , d £l b ( ai ) ^ . 

Then as T — > oo, 

( Ke~^ T e x = l, 

\c(T)\ < 

( KT-Wi-V l y > 1 , 

for some constants 7 and K, and a(T) — > ax- 

If ax does not exist then c(T) = (3bo(a(T)))z , a(T) — > 00, T — > 00. 

We excluded the case of infinite order of vanishing since it is very special from our 
point of view. 

The lemma suggests that c — > is highly nongeneric but it can occur for our system. 
Since for the original time t in (1.1) we would like to go up to time 5log(l/h)/h we 
cannot do it in some cases as then 

( W 5 ' 2 £i = l, 

c{t)\ t =81og(l/h)/h ~ { 

{ log-^-^l//*) £x>l. 

B.3. Avoided crossing for the effective equations of motion. Here we make 
some comments about the puzzling avoided crossing which needs further investigation. 
For the decoupled equations it is easy to find examples in which 

(B.5) cx(T ) = c 2 (T ) . 

One is shown in Fig|6j We take 60 independent of T and equal to cos 2 x. If we 
choose the initial conditions so that c 2 = 3 cos 2 as in (|1.11[), and 
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h=0.1, (A2,c2) 
h=0.1, (A1,c1) 
h=0.3, (A2,c2) 
h=0.3, (A1,c1) 
decoupled 



Figure 6. The plots of (Aj,Cj), j = 1,2, solving ( |B.6[ ) for for 
bo(x,t) = cos 2 x and initial data Ai(0) = — vr/3, A 2 (0) = 7r/6, and 
Ci(0) = a/3 cos(7r/3), c 2 (0) = y/3 cos(7r/6). The "decoupled" curve cor- 



responds to solving (1.11). Because of the choice of initial conditions, 

[Aj, Cj), j 



1,2 line on the same curve. 



-7r/2 < A 1 < -A 2 < 0, then when A X {T Q ) = -y4 2 (T ) we have ( ]B.5[ ) (this also 
provides an example of c 2 (T) -)• as T -> oo). 

The decoupled equations (1.11) should be compared the rescaled version of (1.4): 

(B.6) 



d TCj = d x B (c, A, h) , d T Aj = c]- d c B (c, A, h) 



B (c,A, h) 



def 



q 2 (x/h, c, A/h)b (x)dx . 



For the example above the comparison between the solutions of the decoupled h- 
independent equations and solutions to the equation (B.6) are shown in Fig|6] (the 
solutions (1.11) are shown as a single curve which both solutions with these initial 
data follow). 
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Figure 7. The plots of q 2 (x, c, A/h) for (A j} Cj), j = 1, 2, solving ( |B.6[ ) 
for for bo(x,t) = cos 2 x and initial data v4i(0) = — n/3, A 2 (0) = 7r/6, 
and Ci(0) = a/3 cos(7r/3), c 2 (0) = \/3cos(7r/6). On the left h = 0.1 and 
on the right h = 0.3. 



The dramatic avoided crossings shown in Fig|6] (and also, for a different, time 
dependent bo in Fig|3]) are not seen in the behaviour of q 2 (x, c, A/h) which is the 
approximation of the solution to (1.1) - see Figj7| The masses of the right and left 



solitons are switched and that corresponds to the switch of positions of A\ and A 2 . 
It is possible that a different parametrization of double solitons would resolve this 
problem. Another possibility is to study the decomposition (3.11) in the proof of 



Lemma 3.2 uniformly a — > (corresponding to a 2 — gl\ — > 0). 

We conclude with two heuristic observations. If the decoupled equations lead to 



(B.5) and \A\ — A 2 \ > e > (which is the case when we approach the crossing in 



Fig|6]) then equations (B.6) differ from (1.11) by terms of size 



h log 



c 2 - Ci 
Cl + c 2 



see Lemma 3.2 For this to affect the motion of trajectories on finite time scales in T 
we need 



(B.7) 



c 2 - Ci 



exp 



This means that c/s have to get exponentially close to each other (but does not 
explain avoided crossing). 

On the other hand if \ai — a 2 \ > e > 0, where a,'s are the original variables in (1.4), 



Aj(0) = hdj(0), then we can use the decomposition in Lemma 3.2 and variables aj 
defined by ( |3.7 ). The remark after the proof of Lemma 3.6 shows that the equations 
of motion take essentially the same form written in terms of a/s and c/s and hence 
hj has to stay bounded. And that means that c 2 — c\ is bounded away from 0. Hence, 
when c 2 — c\ — > we must also have a 2 — a\ — > as seen in Fig|3] and Fig|6j 



EFFECTIVE DYNAMICS OF DOUBLE SOLITONS FOR PERTURBED MKDV 



65 



Appendix C. Alternative proof of Lemma 4.7 (with Bernd Sturmfels) 

We note that the standard substition reduces the equation P(c)u = 0, where P(c) 
is defined in (4.20), to an equation with rational coefficients: 

z = tanhx , d x = (1 — z 2 )d z , rj 2 = 1 — z 2 . 

This means that P(c)u = is equivalent to Q(c)v = 0, u(x) = f(tanhx), where 

Q(c) = (L 2 + 1)(L 2 + c 2 ) - 10LR(z)L + 10(3R(z) - 2R(z) 2 ) - 6(1 + c 2 )R(z) , 

and 



1 



(l-z 2 )d z , R(z) = l-z 2 , -1<z<1 



Lemma 4.7 will follow from finding a basis of solutions of Q(c)v = and from 
seeing that the only bounded solution is the one corresponding to d x rj, that is, to 



viz) 



Z(l-Z 2 )z 



Remarkably, and no doubt because of some deeper underlying structure due to com- 
plete integrability, this can be achieved using MAPLE package DEtools. 
First, the operator Q(c) is brought to a convenient form 

Q = ( z - + l) 4 ^-J(z) + 12z(z - lf{z + If^f(z) 

+ (z-l) 2 (z + l) 2 (26z 2 -c 2 + l)^/(z) 

- 2z(z -l)(z+ l)(8z 2 - 11 + c 2 )^f(z) 

+ (4 - 20z 2 + 6c 2 z 2 - 5c 2 + 162 2 )/(z) 

Applying the MAPLE command DFactorsols(Q,f (z)) gives the following explicit 
basis of solutions to Q(c)v — 0, c ^ 1: 



vx{z) 


= (i-z 2 : 


2 z , 


v 2 {z) 


= (! + «)• 


-2(1 - Z ) £ 2((c + Z) 2 + Z 2 - 1), 




= v 2 (-z) 


= (i + z)?(z - iyH(c - zf + z 2 - 1) , 


v 4 (z) 




~U 3zc 2 + 3z 3 c 2 7z 3 + 7z) log Z + 1 
v ' z — 1 




+ (i- 


. z 2 )-\ (4 C 2 _ 6cV + lAz 2 - 12) . 



For c ^ 1 these solutions are linearly independent and only v\ vanishes at z = ±1 (or 



is bounded). Hence ker^2 P(c) is one dimensional proving Lemma 4.7 



(>(:> 
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